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In another embodimem. the invention provides isolated PROl 153 polypeptide encoded by any of the 
isolated nucleic acid sequences hereinabove defined. 

hi a specific aspect, the invention provides isolated native sequence PRO 1 153 polypepude. which in one 
embodimem, includes an amino acid sequence comprising residues 1 to 197 of Figure 246 (SEQ ID NO:351). 

In another aspect, the inveniion concerns an isolated PRO 1 153 polypeptide, comprising an amino acid 
5 sequence having at least about 80% sequence identity, preferably at least about 85 % sequence identity, more 
preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the 
sequence of amino acid residues 1 to about 197, inclusive of Figure 246 (SEQ ID NO:351). 

In a further aspect, the invention concerns an isolated PROl 153 polypeptide, comprising an amino acid 
sequence scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least 
10 about 90% positives, most preferably at least about 95% posnives when compared with the amino acid sequence 
of residues 1 through 197 of Figure 246 (SEQ ID NO:351). 

In a still further aspect, the invention provides a polypeptide produced by (i) hybridizing a test DNA 
molecule under stringent conditions with (a) a DNA molecule encoding a PROl 153 polypeptide having the 
sequence of amino acid residues from about 1 to about 197. inclusive of Figure 246 (SEQ ID NO:351), or (b) 
15 the complement of the DNA molecule of (a), and if the test DNA molecule has at least about an 80% sequence 
identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence 
identity, most preferably at least about a 95% sequence identity to (a) or (b). (ii) culturing a host cell comprising 
the test DNA molecule under conditions suitable for expression of the polypeptide, and (iii) recovering the 
polypeptide from the cell culture. 

20 

106. PR01154 

A cDNA clone (DNA59846-1503) has been identified that encodes a novel aminopeptidase, designated 
in the present application as "PROl 154." 

In one embodiment, the invention provides an isolated nucleic acid molecule comprising DNA encoding 
25 a PROl 154 polypeptide. 

In one aspect, the isolated nucleic acid comprises DNA having at least about 80% sequence identity, 
preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most 
preferably at least about 95% sequence identity to (a) a DNA molecule encoding a PROl 154 polypeptide having 
the sequence of amino acid residues from about 1 or 35 to about 94 1 , inclusive of Figure 248 (SEQ ID NO:353). 
30 or (b) the complement of the DNA molecule of (a). 

In another aspect, the invention concerns an isolated nucleic acid molecule encoding a PROl 154 
polypeptide comprising DNA hybridizing to the complement of the nucleic acid between about residues 86 or 
188 and about 2908, inclusive, of Figure 247 (SEQ ID NO:35 2). Preferably, hybridization occurs under 
stringent hybridization and wash conditions. 
35 In a further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA having 

at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least 
about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule 
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encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposh No. 209978 
(DNA59846-1503), or (b) the complement of the DNA molecule of (a). In a preferred embodiment, the nucleic 
acid comprises a DNA encoding the same mawre polypeptide encoded by the human protein cDNA in ATCC 
Deposit No. 209978 (DNA59846-1503). 

In a still further aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA 

5 encoding a polypeptide having at least about 80% sequence identity, preferably at least about 85% sequence 
identity, more preferably at least about 90% sequence identity, mosi preferably at least about 95% sequence 
identity to the sequence of amino acid residues from about 1 or 35 to about 94 1 . inclusive of Figure 248 (SEQ 
ID NO:353), or the complement of the DNA of (a). 

In a further aspect, the invention concerns an isolated nucleic acid molecule produced by hybridizing 

10 a test DNA molecule under stringent conditions with (a) a DNA molecule encoding a PR01154 polypeptide 
having the sequence of amino acid residues from about 1 or 35 to about 941 , inclusive of Figure 258 (SEQ ID 
NO:353). or (b) the complement of the DNA molecule of (a), and, if the DNA molecule has at least about an 
80 % sequence identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% 
sequence identity, most preferably at least about a 95% sequence identity to (a) or (b), isolating the test DNA 

IS molecule. 

In another aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA 
encoding a polypeptide scoring at least about 80% positives, preferably at least about 85% positives, more 
preferably at least about 90% positives, most preferably at least about 95% positives when compared with the 
amino acid sequence of residues 1 or 35 to about 941. inclusive of Figure 248 (SEQ ID NO:353), or (b) the 
20 complement of the DNA of (a). 

In another aspect, the invention concerns an isolated nucleic acid molecule consisting essentially of DNA 
encoding a polypeptide having amino acids 1 or 35 through about 73 of SEQ ID NO:353. 

In another embodiment, the invention provides isolated PROl 154 polypeptide encoded by any of the 
isolated nucleic acid sequences hereinabove defined. 
25 In a specific aspect, the invention provides isolated native sequence PRO 1 154 polypeptide, which in one 

embodiment, includes an amino acid sequence comprising residues I or 35 to 941 of Figure 248 (SEQ ID 
NO:353). 

In a specific aspect, the invention provides a polypeptide having amino acids I or 35 through about 73 
of SEQ ID NO:353. 

30 In another aspect, the invention concerns an isolated PROl 154 polypeptide, comprising an amino acid 

sequence having at least about 80% sequence identity, preferably at least about 85% sequence identity, more 
preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the 
sequence of amino acid residues 1 or 35 to about 941, inclusive of Figure 248 (SEQ ID NO:353), 

In a further aspect, the invention concerns an isolated PROl 154 polypeptide, comprising an amino acid 

35 sequence scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least 
about 90% positives, most preferably at least about 95% positives when compared with the amino acid sequence 
of residues 1 or 35 through 941 of Figure 248 (SEQ ID NO:353). 
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In yei another aspect, the invention concerns an isolated PRO 1 154 polypeptide , comprising the sequence 
of amino acid residues 1 or 35 to abou! 941. inclusive of Figure 248 (SEQ ID NO:353). or a fragment thereof 
sufficiem to provide a binding site for an anii-PROl 154 antibody. Preferably, the PROl 154 fragment retains 
a qualitative biological activity of a native PROl 154 polypeptide. 

In a still further aspect, ihe invention provides a polypeptide produced by (i) hybridizing a test DNA 
5 molecule under stringent conditions with (a) a DNA molecule encoding a PROl 154 polypeptide having the 
sequence of amino acid residues from about 1 or 35 to about 94 1 . inclusive of Figure 248 (SEQ ID NO:353), 
or (b) the complement of the DNA molecule of (a), and if the test DNA molecule has at least about an 80% 
sequence identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% 
sequence identity, most preferably at least abcwt a 95% sequence identity to (a) or (b), (ii) culturing a host cell 
10 comprising the test DNA molecule under conditions suitable for expression of the polypeptide, and (iii) 
recovermg the polypeptide from the cell culture. 

In yet another embodiment, the invention concerns agonists and antagonists of the a native PROl 154 
polypeptide. In a panicular embodiment, the agonist or antagonist is an anii-PROl 154 antibody. 

In a further embodiment, the invention concerns a method of identifying agonists or antagonists of a 
15 native PROl 154 polypeptide, by contacting the native PROl 154 polypeptide with a candidate molecule and 
monitoring a biological activity mediated by said polypeptide. 

In a still further embodiment, the invention concerns a composition comprising a PRO 1 154 polypeptide, 
or an agonist or antagonist as hereinabove defmed. in combination with a pharmaceutically acceptable carrier. 

20 107. PROl 181 

A cDNA clone (DNA59847-1511) has been identified that encodes a novel secreted polypeptide, 
designated in the present application as "PROl 181". 

In one embodiment, the invention provides an isolated nucleic acid molecule comprising DNA encoding 
a PROl 181 polypeptide. 

25 In one aspect, the isolated nucleic acid comprises DNA having at least about 80% sequence identity, 

preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most 
preferably at least about 95 % sequence idemity to (a) a DNA molecule encoding a PROl 181 polypeptide having 
the sequence of amino acid residues from about 1 or about 16 to about 437. inclusive of Figure 250 (SEQ ID 
NO:355). or (b) the complement of the DNA molecule of (a). 

30 In another aspect, the invention concerns an isolated nucleic acid molecule encoding a PROl 181 

polypeptide comprising DNA hybridizing to the complement of the nucleic acid between about nucleotides 17 
or about 62 and about 1327. inclusive, of Figure 249 (SEQ ID NO:354). Preferably , hybridization occurs under 
stringent hybridization and wash conditions. 

In a further aspect, die invention concerns an isolated nucleic acid molecule comprising DNA having 

35 at least about 80% sequence idemity. preferably at least about 85 % sequence identity, more preferably at least 
about 90% sequence idemity, most preferably at least about 95% sequence identity to (a) a DNA molecule 
encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203098 
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(DNA59847- 15 1 1 ) or (b) the complement of the nucleic acid molecule of (a). In a preferred embodiment, the 
nucleic acid comprises a DNA encoding the same mature polypeptide encoded by the human protein cDNA in 
ATCC Deposit No. 203098 (DNA59847-1511). 

In still a further aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA 
encoding a polypeptide having at least about 80% sequence idemity. preferably at least about 85% sequence 
idenuty, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence 
identity to the sequence of amino acid residues 1 or about 16 to about 437, inclusive of Figure 250 (SEQ ID 
NO:355). or (b) the complement of the DNA of (a). 

In a further aspect, the invention concerns an isolaied nucleic acid molecule having ai least 10 
nucIetMides and produced by hybridizing a test DNA molecule under stringent conditions with (a) a DNA 
molecule encoding a PROl 181 polypepUde having the sequence of amino acid residues from 1 or about 16 to 
about 437. inclusive of Figure 250 (SEQ ID NO:355). or (b) the complemem of the DNA molecule of (a), and. 
if the DNA molecule has at least about an 80 % sequence idemity. prefcreably at least about an 85% sequence 
identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95% sequence 
identity to (a) or (b). isolating the test DNA molecule. 
15 In a specific aspect, the invention provides an isolated nucleic acid molecule comprising DNA encoding 

a PROl 181 polypeptide, with or without the N-terminal signal sequence and/or the initiating methionine or is 
complementary to such encoding nucleic acid molecule. The signal peptide has been tentatively idemified as 
extending from about amino acid position I to about amino acid position 15 in the sequence of Figure 250 (SEQ 
ID NO:355). The transmembrane domain is at amino acids positions 243-260 of Figure 250. 

In another aspect, the invemion concerns an isolated nucleic acid molecule comprising (a) DNA 
encoding a polypeptide scoring at least about 80% positives, preferably ai least about 85% positives, more 
preferably at least about 90% positives, most preferably at least about 95% positives when compared with the 
amino acid sequence of residues 1 or about 16 to about 437. inclusive of Figure 250 (SEQ ID NO:355), or (b) 
the complement of the DNA of (a). 
25 Another embodiment is directed to fragments of a PROl 181 polypeptide coding sequence that may find 

use as hybridization probes. Such nucleic acid fragments are from about 20 to about 80 nucleotides in length, 
preferably from about 20 to about 60 nucleoudes in length, more preferably from about 20 to about 50 
nucleotides in lengUi and most preferably from about 20 to about 40 nucleotides in length and may be derived 
from the nucleotide sequence shown in Figure 249 (SEQ ID NO:354). 
30 In another embodimem, the invemion provides isolated PROl 181 polypeptide encoded by any of the 

isolated nucleic acid sequences hereinabove identified. 

In a specific aspect, the invention provides isolated native sequence PROl 181 polypeptide, which in 
certain embodimeras. includes an amino acid sequence comprising residues 1 or about 16 to abom 437 of Figure 
250 (SEQ ID NO:355). 

35 In another aspect, the invemion concerns an isolated PROl 181 polypeptide, comprising an amino acid 

sequence having at least about 80% sequence identity, preferably at least about 85% sequence idemiiy. more 
preferably at least about 90% sequence idemity. most preferably at least about 95% sequence idemiiy to the 
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sequence of amino acid residues 1 or about 16 to about 437, inclusive of Figure 250 (SEQ ID NO:3S5). 

In a further aspect, the invention concerns an isolated PROl 181 polypeptide, comprising an amiiso acid 
sequence scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least 
about 90% positives, most preferably at least about 95 % positives when compared with the amino acid sequence 
of residues 1 or about 16 to about 437, inclusive of Figure 250 (SEQ ID NO:355). 
5 In yet another asj^ct, the invention concerns an isolated PRO 1181 polypeptide , comprising the sequence 

of amino acid residues 1 or about 16 to about 437, inclusive of Figure 250 (SEQ ID NO:355), or a fragment 
thereof sufficient to provide a binding site for an anti-PROl 181 antibody. Preferably, the PROl 181 fragment 
retains a qualitative biological activity of a native PROl 181 polypeptide. 

In a still further aspect, the invention provides a polypeptide produced by (i) hybridizing a test DNA 

10 molecule under stringent conditions with (a) a DNA molecule encoding a PROl 181 polypeptide having the 
sequence of amino acid residues from about 1 or about 16 to about 437, inclusive of Figure 250 (SEQ ID 
NO:355), or (b) the complement of the DNA molecule of (a), and if the test DNA molecule has at least about 
an 80% sequence identity, preferably at least about an 85% sequence identity, more preferably at least about a 
90% sequence identity, most preferably at least about a 95% sequence identity to (a) or (b), (ii)culmring a host 

15 cell comprising the test DNA molecule under conditions suitable for expression of the polypeptide, and (iii) 
recovering the polypeptide from the cell culture. 

108. PROl 182 

A cDNA clone (DNA59848-1512) has been idemified, having homology to nucleic acid encoding 
20 conglutinin that encodes a novel polypeptide, designated in the present application as "PRGllSZ". 

In one embodiment, the invention provides an isolated nucleic acid molecule comprising DNA encoding 
a PROl 182 polypeptide. 

In one aspect, the isolated nucleic acid comprises DNA having at least about 80% sequence identity, 
preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most 

25 preferably at least about 95 % sequence identity to (a) a DNA molecule encoding a PRO 1 1 82 polypeptide having 
the sequence of amino acid residues from about 1 or about 26 to about 271. inclusive of Figure 252 (SEQ ID 
NO:357), or (b) the complement of the DNA molecule of (a). 

In another aspect, the invention concerns an isolated nucleic acid molecule encoding a PROl 182 
polypeptide comprising DNA hybridizing to the complement of the nucleic acid between about nucleotides 67 

30 or about 142 and about 879, inclusive, of Figure 251 (SEQ ID NO:356). Preferably, hybridization occurs under 
stringent hybridization and wash conditions. 

In a further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA having 
at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least 
about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule 

35 encoding the same manire polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203088 
(DNA59848-1512) or (b) the complement of the nucleic acid molecule of (a). In a preferred embodiment, the 
nucleic acid comprises a DNA encoding the same mature polypeptide encoded by the human protein cDN A in 
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ATCC Dcposii No. 203088 (DNA59848-1512). 

]n still a further aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA 
encoding a polypeptide having at least atK)ut 80% sequence identity, preferably at least about 85% sequence 
identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence 
identity to the sequence of amino acid residues 1 or about 26 to about 271, inclusive of Figure 252 (SEQ ID 
5 NO:357), or (b) the complement of the DNA of (a). 

In a further aspect, the invention concerns an isolated nucleic acid molecule having at least 10 
nucleotides and produced by hybridizing a test DNA molecule under stringent conditions with (a) a DNA 
molecule encoding a PROl 182 polypeptide having the sequence of amino acid residues from 1 or about 26 to 
about 271, inclusive of Figure 252 (SEQ ID NO:357), or (b) the complement of the DNA molecule of (a), and, 
10 if the DNA molecule has at least about an 80 % sequence identity, prefereably at least about an 85% sequence 
identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95% sequence 
identity to (a) or (b), isolating the test DNA molecule. 

In a specific aspect, the invention provides an isolated nucleic acid molecule comprising DNA encoding 
a PROl 182 polypeptide, with or without the N-terminal signal sequence and/or the initiating methionine or is 
15 complementary to such encoding nucleic acid molecule. The signal peptide has been tentatively identified as 
extending from about amino acid position 1 to about amino acid position 25 in the sequence of Figure 252 (SEQ 
ID NO:357). 

In another aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA 

encoding a polypeptide scoring at least about 80% positives, preferably at least about 85% positives, more 
20 preferably at least about 90% positives, most preferably at least about 95% positives when compared with the 

amino acid sequence of residues 1 or about 26 to about 271, inclusive of Figure 252 (SEQ ID NO:357), or (b) 

the complement of the DNA of (a). 

Another embodiment is directed to fragments of a PROl 182 polypeptide coding sequence that may fmd 

use as hybridization probes. Such nucleic acid fragments are from about 20 to about 80 nucleotides in length, 
25 preferably from about 20 to about 60 nucleotides in length, more preferably from about 20 to about 50 

nucleotides in length and most preferably from about 20 to about 40 nucleotides in length and may be derived 

from the nucleotide sequence shown in Figure 251 (SEQ ID NO: 356). 

In another embodiment, the invention provides isolated PROl 182 polypeptide encoded by any of the 

isolated nucleic acid sequences hereinabove identified. 
30 In a specific aspect, the invention provides isolated native sequence PROl 182 polypeptide, which in 

cenain embodiments, includes an amino acid sequence comprising residues 1 or about 26 to about 271 of Figure 

252 (SEQ ID NO:357). 

In another aspect, the invention concerns an isolated PROl 182 polypeptide, comprising an amino acid 
sequence having at least about 80% sequence identity, preferably at least about 85% sequence identity, more 
35 preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the 
sequence of amino acid residues 1 or about 26 to about 271, inclusive of Figure 252 (SEQ ID NO:357). 



225 



BNSOOCID: <WO_99630e8A2J.> 



10 



PCr/US99/12252 

In a further aspeci. the inveniion concern an isolated PROl 182 polypeptide, comprisii^ an amino acid 
sequence scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least 
about 90% positives, most preferably at least about 95% positives when compared with the amiiwacid sequence 
of residues 1 or about 26 to about 271, inclusive of Figure 252 (SEQ ID NO:357), 

In yet another aspect, the invention concerns an isolated PRO 1 1 82 polypeptide, comprising the sequence 
of amino acid residues 1 or about 26 to about 271, inclusive of Figure 252 (SEQ ID NO:357), or a fragment 
thereof sufficient to provide a binding site for an anti-PROl 182 antibody. Preferably, the PR01182 fragment 
retains a qualitative biological activity of a native PROl 182 polypeptide. 

In a still funher aspeci, the invention provides a polypeptide produced by (i) hybridizing a test DNA 
molecule under siringem conditions with (a) a DNA molecule encoding a PROl 182 polypeptide having the 
sequence of amino acid residues from about 1 or about 26 to about 271, inclusive of Figure 252 (SEQ ID 
NO:357), or (b) the complement of the DNA molecule of (a), and if the test DNA molecule has at least about 
an 80% sequence idemity. preferably at least about an 85% sequence identity, more preferably at least about a 
90% sequence idemity, most preferably at least about a 95% sequence idemity to (a) or (b), (ii) culniring a host 
cell comprising the test DNA molecule under conditions suitable for expression of the polypeptide, and (iii) 
15 recovering the polypeptide from the cell culmre. 

In yet another embodiment, the invention concerns agonists and antagonists of a native PROl 182 
polypeptide. In a panicular embodiment, the agonist or antagonist is an anii-PRQl 182 antibody. 

In a funher embodiment, the invention concerns a method of identifying agonists or antagonists of a 
native PROl 182 polypeptide by contacting the native PRO 11 82 polypeptide with a candidate molecule and 
20 monitoring a biological activity mediated by said polypeptide. 

In a still further embodiment, the invention concerns a composition comprising a PROl 182 polypeptide, 
or an agonist or antagonist as hereinabove defmed, in combination with a pharmaceutical ly acceptable carrier. 

109. PROl 155 

25 A cDNA clone (DNA59849-I504) has been identified, having sequence identity with neurokinin B dial 

encodes a novel polypeptide, designated in the present application as "PROl 155. " 

In one embodiment, the inveniion provides an isolated nucleic acid molecule comprising DNA encoding 
a PROl 155 polypeptide. 

In one aspect, the isolated nucleic acid comprises DNA having at least about 80% sequence identity, 
30 preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most 
preferably at least about 95 % sequence idemity to (a) a DNA molecule encoding a PROl 155 polypeptide having 
the sequence of amino acid residues from about I or 19 to about 135, inclusive of Figure 254 (SEQ ID NO:359), 
or (b) the complement of the DNA molecule of (a). The term "or*' as used herein to refer to nucleic or amino 
acids is meant to convey alternative embodiments, i.e., M35 or alternatively in another embodiment, 19-135. 
35 In another aspeci, the invention concerns an isolated nucleic acid molecule encoding a PROl 155 

polypeptide comprising DNA hybridizing to the complement of the nucleic acid between about residues 158 or 
212 and about 562, inclusive, of Figure 253 (SEQ ID NO:358). Preferably, hybridization occurs under stringem 
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hybridizaiion and wash conditions. 

In a funher aspect, the invemion concerns an isolated nucleic acid molecule comprising DNA having 

at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least 

about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule 

encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 209986 
5 (DNA59849-1504), or (b) the complement of the DNA molecule of (a). In a preferred embodiment, the nucleic 

acid comprises a DNA encoding the same mature polypeptide encoded by the hmnan protein cDNA in ATCC 

Deposit No. 209986 (DNA59849-I504). 

In a still further aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA 

encoding a polypeptide having at least about 80% sequence identity, preferably at least about 85% sequence 
10 identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence 

identity to the sequence of amino acid residues from about 1 or 19 to about 135. inclusive of Figure 254 (SEQ 

ID NO:359), or the complement of the DNA of (a). 

In a further aspect, the invention concerns an isolated nucleic acid molecule produced by hybridizing 

a test DNA molecule under stringent conditions with (a) a DNA molecule encoding a PRO 1155 polypeptide 
15 having the sequence of amino acid residues from about 19 to about 135. inclusive of Figure 254 (SEQ ID 

NO:359). or (b) the complement of the DNA molecule of (a), and. if the DNA molecule has at least about an 

80 % sequence identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% 

sequence identity, most preferably at least about a 95% sequence identity to (a) or (b). isolating the test DNA 

molecule. 

20 In another aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA 

encoding a polypeptide scoring at least about 80% positives, preferably at least about 85% positives, more 
preferably at least about 90% positives, most preferably at least about 95% positives when compared with the 
amino acid sequence of residues 1 or 19 to about 135. inclusive of Figure 254 (SEQ ID NO:359). or (b) the 
complement of the DNA of (a). 

25 In another embodiment, the invention provides isolated PROl 155 polypeptide encoded by any of the 

isolated nucleic acid sequences hereinabove defmed. 

In a specific aspect, the invention provides isolated native sequence PROl 155 polypeptide, which in one 
embodiment, includes an amino acid sequence comprising residues 1 or 19 through 135 of Figure 254 (SEQ ID 
NO:359). 

30 In another aspect, the invention concerns an isolated PROl 155 polypeptide, comprising an amino acid 

sequence having at least about 80% sequence identity, preferably at least about 85% sequence identity, more 
preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the 
sequence of amino acid residues 1 or 19 to about 135. inclusive of Figure 254 (SEQ ID NO:359). 

In a further aspect, the invention concerns an isolated PROl 155 polypeptide, comprising an amino acid 

35 sequence scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least 
about 90% positives, most preferably at least about 95% posuives when compared with the amino acid sequence 
of residues 1 or 19 through 135 of Figure 254 (SEQ ID NO:359). 
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in a still further aspect, the invention provides a polypeptide produced by (i) hybridizing a test DNA 
molecule under stringent conditions with (a) a DNA molecule encoding a PR01155 polypeptide having the 
sequence of amino acid residues from about 1 or 19 to about 135, inclusive of Figure 254 (SEQ ID NO:359), 
or (b) the complement of the DNA molecule of (a), and if the test DNA molecule has at least about an 80% 
sequence identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% 
5 sequence identity, most preferably at least about a 95% sequence identity to (a) or (b). (ii) culturing a host cell 
comprising the test DNA molecule under conditions suitable for expression of the polypeptide, and (iii) 
recovering the polypeptide from the cell culture. 

In yei another embodiment, the invention concerns agonists and antagonists of the a native PROl 155 
polypeptide. In a panicular embodiment, the agonist or antagonist is an anti-PROl 155 antibody. 
10 In a further embodiment, the invention concerns a method of identifying agonists or antagonists of a 

native PR01155 polypeptide, by contacting die native PROl 155 polypeptide with a candidate molecule and 
monitoring a biological activity mediated by said polypeptide. 

In a still further embodiment, the invention concerns a composition comprising a PROl 155 polypeptide, 
or an agonist or antagonist as hereinabove defined, in combination with a pharmaceutically acceptable carrier. 

15 

110. PROl 156 

A cDNA clone (DNA59853-1505) has been identiHcd that encodes a novel secreted polypeptide, 
designated in the present application as "PR01156." 

In one embodiment, die invention provides an isolated nucleic acid molecule comprising DNA encoding 

20 a PROl 156 polypeptide. 

In one aspect, the isolated nucleic acid comprises DNA having ai least about 80% sequence identity, 
preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most 
preferably at least about 95% sequence identity to (a) a DNA molecule encoding a PROl 156 polypeptide having 
the sequence of amino acid residues from about 23 to about 159, inclusive of Figure 256 (SEQ ID NO:361), or 

25 (b) the complement of the DNA molecule of (a). 

In anodier aspect, die invention concerns an isolated nucleic acid molecule encoding a PROl 156 
polypeptide comprising DNA hybridizing to die complement of die nucleic acid between about residues 281 and 
about 688, inclusive, of Figure 255 (SEQ ID NO:360). Preferably, hybridization occurs under stringent 
hybridization and wash conditions. 

30 In a further aspect, die invention concerns an isolated nucleic acid molecule comprising DNA having 

at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least 
about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule 
encoding die same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 209985 
(DNA59853-I505). or (b) die complement of die DNA molecule of (a), in a preferred embodimem, the nucleic 

35 acid comprises a DNA encoding die same mature polypeptide encoded by die human protein cDNA in ATCC 
Deposit No, 209985 (DNA59853-1505). 
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In a still further aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA 
encoding a polypeptide having at least about 80% sequence identity, preferably at least about 85% sequence 
identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence 
identity to the sequence of amino acid residues from about 23 to about 159, inclusive of Figure 256 (SEQ ID 
NO:361), or the complement of the DNA of (a). 
5 In a further aspect, the invention concerns an isolated nucleic acid molecule having at least 50 

nucleotides, preferably at least 100 nucleotides, and produced by hybridizing a test DNA molecule under 
stringent conditions with (a) a DNA molecule encoding a PROl 156 polypeptide having the sequence of amino 
acid residues from about 23 to about 159. inclusive of Figure 256 (SEQ ID NO:361), or (b) the complemem of 
the DNA molecule of (a), and, if the DNA molecule has at least about an 80 % sequence identity, preferably 

10 at least about an 85% sequence identity, more preferably ai least about a 90% sequence identity, most preferably 
at least about a 95% sequence identity to (a) or (b), isolating the test DNA molecule. 

In a specific aspect, the invention provides an isolated nucleic acid molecule comprising DNA encoding 
a PROl 156 polypeptide, with or without the N-terminal signal sequence and/or the initiating methionine, or is 
complementary to such encoding nucleic acid molecule. The signal peptide has been tentatively identified as 

15 extending from amino acid position 1 to about amino acid position 22 in the sequence of Figure 256 (SEQ ID 
NO:361). 

In another aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA 
encoding a polypeptide scoring at least about 80% positives, preferably ai least about 85% positives, more 
preferably at least about 90% positives, most preferably ai least about 95% positives when compared with the 
amino acid sequence of residues 23 to about 159, inclusive of Figure 256 (SEQ ID NO:361). or (b) the 
complement of the DNA of (a). 

In another aspect, the invention concerns hybridization probes thai comprise fragments of the PR0784 
coding sequence, or complementary sequence thereof. The hybridization probes preferably have at least about 
20 nucleotides to about 80 nucleotides, and more preferably, at least about 20 to about 50 nucleotides. 
25 In another embodiment, the invention provides isolated PROl 156 polypeptide encoded by any of the 

isolated nucleic acid sequences hereinabove defined. 

In a specific aspect, the invention provides isolated native sequence PROl 156 polypeptide, which m one 
embodiment, includes an amino acid sequence comprising residues 23 to 159 of Figure 256 (SEQ ID NO:361). 
In another aspect, the invention concerns an isolated PROl 156 polypeptide, comprising an amino acid 
30 sequence having at least about 80% sequence idemity. preferably at least about 85% sequence identity, more 
preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the 
sequence of amino acid residues 23 to about 159, inclusive of Figure 256 (SEQ ID NO:361). 

In a funher aspect, the invemion concerns an isolated PROl 156 polypeptide, comprising an amino acid 
sequence scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least 
35 about 90% positives, most preferably at least about 95 % positives when compared with the amino acid sequence 
of residues 23 to 159 of Figure 256 (SEQ ID NO:361). 
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In yet another aspect, the invention concerns an isolated PRO 1 1 56 polypeptide , comprising the sequence 
of amino acid residues 23 to about 159. inclusive of Figure 256 (SEQ ID NO:361), or a fragment thereof 
sufficient to provide a binding site for an ami-PROl 156 antibody. Preferably, the PROl 156 fragment retains 
a qualitative biological activity of a native PROl 156 polypeptide. 

In a still further asj^ct, the inveniion provides a polypeptide produced by (i) hybridizing a test DNA 
5 molecule under su-ingeni conditions with (a) a DNA molecule encoding a PROl 156 poiypepiide having the 
sequence of amino acid residues from about 23 to about 159, inclusive of Figure 256 (SEQ ID N0:361 ), or (b) 
the complement of the DNA molecule of (a), and if the test DNA molecule has at least about an 80% sequence 
identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence 
identity, most preferably at least about a 95% sequence identity to (a) or (b), (ii) culturing a host cell comprising 
10 the test DNA molecule under conditions suitable for expression of the polyi>eptide, and (iii) recovering the 
polypeptide from the cell culture. 

111. PRO1098 

A cDNA clone (DNA59854-1459) has been identified which encodes a novel polypeptide, designated 
1 5 in the present application as "PRO1098. " 

In one embodiment, the invention provides an isolated nucleic acid molecule comprising DNA encoding 
a PRO 1098 polypeptide. 

In one aspect, the isolated nucleic acid comprises DNA having at least about 80% sequence identity, 
preferably at least about 85% sequence identity, more preferably ai least about 90% sequence identity, most 
20 preferably at least about 95% sequence identity to (a) a DNA molecule encoding a PRO 1098 polypeptide having 
the sequence of amino acid residues from about 1 or 20 to about 78, inclusive of Figure 258 (SEQ ID NO:363), 
or (b) the complement of the DNA molecule of (a). The term "or'' as used herein to refer to amino or nucleic 
acids is meant to refer to two alternative embodiments provided herein, i.e., 1-78, or in another embodiment, 
20-78. 

25 In another aspect, the invention concerns an isolated nucleic acid molecule encoding a PRO 1098 

polypeptide comprising DNA hybridizing to the complement of the nucleic acid between about residues 58 or 
1 15 and about 291 , inclusive, of Figure 257 (SEQ ID NO:362). Preferably, hybridization occurs under stringent 
hybridization and wash conditions. 

In a further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA having 

30 at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least 
about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule 
encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 209974 
(DNA59854-i459), or (b) the complement of the DNA molecule of (a). In a preferred embodiment, the nucleic 
acid comprises a DNA encoding the same mamre polypeptide encoded by the human protein cDN A in ATCC 

35 Deposit No. 209974 (DNA59854-1459). 

In a still further aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA 
encoding a polypeptide having at least about 80% sequence identity, preferably at least about 85% sequence 
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identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence 
identity to the sequence of amino acid residues from about 1 or 20 to about 78, inclusive of Figure 258 (SEQ 
ID NO:363). or the complement of the DNA of (a). 

In a further aspect, the invention concerns an isolated nucleic acid molecule produced by hybridizing 
a test DNA molecule under siringem conditions with (a) a DNA molecule encoding a PRO1098 polypeptide 
5 having the sequence of amino acid residues from about 1 or 20 to about 78, inclusive of Figure 258 (SEQ ID 
NO:363). or (b) the complement of the DNA molecule of (a), and, if the DNA molecule has at least about an 
80 % sequence identity, preferably at least about an 85 % sequence identity, more preferably at least about a 90 % 
sequence identity, most preferably at least about a 95% sequence identity to (a) or (b), isolating the test DNA 
molecule. 

10 In another aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA 

encoding a polypeptide scoring at least about 80% positives, preferably at least about 85% positives, more 
preferably at least about 90% positives, most preferably at least about 95% positives when compared with the 
amino acid sequence of residues I or 20 to about 78, inclusive of Figure 258 (SEQ ID NO:363), or (b) the 
complement of the DNA of (a). 

15 In another embodiment, the invention provides isolated PRO1098 polypeptide encoded by any of the 

isolated nucleic acid sequences hereinabove defined. 

In a specific aspect, the invention provides isolated native sequence PRO 1098 polypeptide, which in one 
embodiment, includes an amino acid sequence comprising residues 1 or 20 through 78 of Figure 258 (SEQ ID 
NO:363). 

20 In another aspect, the invention concerns an isolated PRO 1098 polypeptide, comprising an amino acid 

sequence having at least about 80% sequence identity, preferably at least about 85% sequence identity, more 
preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the 
sequence of amino acid residues I or 20 to about 78, inclusive of Figure 258 (SEQ ID NO:363). 

In a further aspect, the invention concerns an isolated PRO1098 polypeptide, comprising an amino acid 

25 sequence scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least 
about 90% positives, most preferably at least about 95% positives when compared with the amino acid sequence 
of residues 1 or 20 through 78 of Figure 258 (SEQ ID NO:363). 

In a still further aspect, the invention provides a polypeptide produced by (i) hybridizing a test DNA 
molecule under stringent conditions with (a) a DNA molecule encoding a PRO1098 polypeptide having the 

30 sequence of amino acid residues from about 1 or 20 to about 78, inclusive of Figure 258 (SEQ ID NO:363), 
or (b) the complement of the DNA molecule of (a), and if the test DNA molecule has at least about an 80% 
sequence identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% 
sequence identity, most preferably at least about a 95% sequence identity to (a) or (b), (ii) culturing a host cell 
comprising the test DNA molecule under conditions suitable for expression of the polypeptide, and (iii) 

35 recovering the polypeptide from the cell culture. 
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112. PR01127 

A cDNA clone (DNA60283-1484) has been identified ihat encodes a novel secreted polypeptide, 
designated in the present application as "PR01127." 

In one embodiment, the invention provides an isolated nucleic acid molecule comprising DN A encoding 
a PRO 1 127 polypeptide. 

5 In one aspect, the isolated nucleic acid comprises DNA having ai least about 80% sequence identity, 

preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most 
preferably at least about 95% sequence identity to (a) a DNA molecule encoding a PRO! 127 polypeptide having 
the sequence of amino acid residues from about 1 or 30 to about 67, inclusive of Figure 260 (SEQ ID NO:365), 
or (b) the complement of the DNA molecule of (a). The term ''or'* in reference to amino or nucleic acids as used 

10 herein refers to two alternative embodiments, i.e., 1-67 in one embodiment, or alternatively, 30-67. 

In another aspect, die invention concerns an isolated nucleic acid molecule encoding a PRO 1127 
polypeptide comprising DNA hybridizing to the complement of the nucleic acid between about residues 126 or 
213 and about 326, inclusive, of Figure 259 (SEQ ID NO:364). Preferably, hybridization occurs under stringent 
hybridization and wash conditions. 

15 In a further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA having 

at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least 
about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule 
encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203043 
(DNA60283-1484), or (b) the complement of the DNA molecule of (a). In a preferred embodiment, the nucleic 

20 acid comprises a DNA encoding the same mawre polypeptide encoded by the human protein cDNA in ATCC 
Deposit No. 203043 (DNA60283-1484). 

In a still fimhcr aspect, the invention concerns an isolated nucleic acid molecule comprising <a) DNA 
encoding a polypeptide having at least about 80% sequence identity, preferably at least about 85% sequence 
identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence 

25 identity to the sequence of amino acid residues from about 1 or 30 to about 67, inclusive of Figure 260 (SEQ 
ID NO:365), or the complement of the DNA of (a). 

In a further aspect, the invention concerns an isolated nucleic acid molecule produced by hybridizing 
a test DNA molecule imder struigent conditions widi (a) a DNA molecule encoding a PR01127 polypeptide 
having the sequence of amino acid residues from about 1 or 30 to about 67, inclusive of Figure 260 (SEQ ID 

30 NO:365), or (b) the complement of the DNA molecule of (a), and. if the DNA molecule has at least about an 
80% sequence identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% 
sequence identity, most preferably at least about a 95% sequence identity to (a) or (b). isolating the test DNA 
molecule. 

In a specific aspect, die invention provides an isolated nucleic acid molecule comprising DNA encoding 
35 a PRO 1127 polypeptide without the N-terminal signal sequence and/or the initiating methionine. The signal 
peptide has been tentatively identified as extending from amino acid position 1 through about amino acid position 
29 in the sequence of Figure 260 (SEQ ID NO:365). 
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In another aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA 
encoding a polypeptide scoring at least about 80% positives, preferably at least about 85% positives, more 
preferably at least about 90% positives, most preferably at least about 95% positives when compared with the 
amino acid sequence of residues 1 or 30 to about 67, inclusive of Figure 260 (SEQ ID NO:365). or (b) the 
complement of the DNA of (a). 
5 Another embodiment is directed to fragments of a PRO 1 127 polypeptide coding sequence that may find 

use as hybridization probes. Such nucleic acid fragments are from about 20 through about m nucleotides in 
length, preferably from about 20 through about 60 nucleotides in length, more preferably from about 20 through 
about 50 nucleotides in length, and most preferably from about 20 through about 40 nucleotides in length. 

In another embodiment, the invention provides isolated PRO 1 127 polypeptide encoded by any of the 
10 isolated nucleic acid sequences hereinabove defined. 

In a specific aspect, the invention provides isolated native sequence PRO 1 127 polypeptide, which in one 
embodiment, includes an amino acid sequence comprising residues 1 or 30 through 67 of Figure 260 (SEQ ID 
NO:365). 

In another aspect, the invention concerns an isolated PROl 127 polypeptide, comprising an amino acid 
15 sequence having at least about 80% sequence identity, preferably at least about 85% sequence identity, more 

preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the 

sequence of amino acid residues 1 or 30 to about 67, inclusive of Figure 260 (SEQ ID NO:365). 

In a further aspect, the invention concerns an isolated PRO 1 127 polypeptide, comprising an amino acid 

sequence scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least 
20 about 90% positives, most preferably at least about 95 % positives when compared with the amino acid sequence 

of residues 1 or 30 through 67 of Figure 260 (SEQ ID NO:365). 

In a still further aspect, the invention provides a polypeptide produced by (i) hybridizing a test DNA 

molecule under stringent conditions with (a) a DNA molecule encoding a PROl 127 polypeptide having the 

sequence of amino acid residues from about I or 30 to about 67, inclusive of Figure 260 (SEQ ID NO:365), or 
25 (b) the complement of the DNA molecule of (a), and if the test DNA molecule has at least about an 80% 

sequence identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% 

sequence identity, most preferably at least about a 95% sequence identity to (a) or (b), (ii) culiuring a host cell 

comprising the lest DNA molecule under conditions suitable for expression of the polypeptide, and (iii) 

recovering the polypeptide from the cell culmre. 
30 In yet another embodiment, the invention concerns agonists and antagonists of the a native PROl 127 

polypeptide. In a particular embodiment, the agonist or antagonist is an anii-PROl 127 antibody. 

In a further embodiment, the invention concerns a method of identifying agonists or antagonists of a 

native PROl 127 polypeptide, by contacting the native PROl 127 polypeptide with a candidate molecule and 

monitoring a biological activity mediated by said polypeptide. 
35 In a still further embodiment, the invention concerns a composition comprising a PROl 127 polypeptide, 

or an agonist or antagonist as hereinabove defmed, in combination with a pharmaceuiically acceptable carrier. 
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113. PR01126 

A cDNA clone (DNA«)615-1483) has been identified, having homology to nucleic acid encoding 
olfactomedin that encodes a novel polypeptide, designated in the present application as "PROlllG**. 

In one embodiment, the invention provides an isolated nucleic acid molecule comprising DN A encoding 
a PROl 126 polypeptide. 

5 In one aspect, the isolated nucleic acid comprises DNA having at least about 80% sequence identity, 

preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most 
preferably at least about 95 % sequence identity to (a) a DNA molecule encoding a PROl 126 polypeptide having 
the sequence of amino acid residues from about 1 or about 26 to about 402, inclusive of Figure 262 (SEQ ID 
NO:367), or (b) the complement of the DNA molecule of (a). 

10 In another aspect, the invention concerns an isolated nucleic acid molecule encoding a PROl 126 

polypeptide comprising DNA hybridizing to the complement of the nucleic acid between about nucleotides 1 10 
or about 185 and about 1315, inclusive, of Figure 261 (SEQ ID NO:366). Preferably, hybridization occurs 
imder stringent hybridization and wash conditions. 

In a funher aspect, the invention concerns an isolated nucleic acid molecule comprising DNA having 

15 at least about 80% sequence identity, preferably ai least about 85% sequence identity, more preferably at least 
about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule 
encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 209980 
(DNA60615-1483) or (b) the complement of the nucleic acid molecule of (a). In a preferred embodiment, the 
nucleic acid comprises a DNA encoding the same mature polypeptide encoded by the human protein cDNA in 

20 ATCC Deposit No. 209980 (DNA60615-1483). 

In still a further aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA 
encoding a polypeptide having at least about 80% sequence identity, preferably at least about 85% sequence 
identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence 
identity to the sequence of amino acid residues 1 or about 26 to about 402, inclusive of Figure 262 (SEQ ID 

25 NO:367), or (b) the complement of the DNA of (a). 

In a further aspect, the invention concerns an isolated nucleic acid molecule having at least 10 
nucleotides and produced by hybridizing a test DNA molecule imder suingent conditions with (a) a DNA 
molecule encoding a PROl 126 polypeptide having the sequence of amino acid residues from 1 or about 26 to 
about 402, inclusive of Figure 262 (SEQ ID NO:367), or (b) the complement of the DNA molecule of (a), and, 

30 if the DNA molecule has at least about an 80 % sequence identity, prefereably at least about an 85% sequence 
identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95% sequence 
identity to (a) or (b), isolating the test DNA molecule. 

In a specific aspect, the invention provides an isolated nucleic acid molecule comprising DNA encoding 
a PROl 126 polypeptide, with or without the N-ierminal signal sequence and/or the initiating methionine, or is 

35 complementary to such encoding nucleic acid molecule. The signal peptide has been tentatively identified as 
extending from about amino acid position 1 to about amino acid position 25 in the sequence of Figure 262 (SEQ 
ID NO:367). 
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In another aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA 
encoding a polypeptide scoring at least about 80% positives, preferably at least about 85% positives, more 
preferably at least about 90% positives, most preferably at least about 95% positives when compared with the 
amino acid sequence of residues 1 or about 26 to about 402, inclusive of Figure 262 (SEQ ID NO:367), or (b) 
the complement of the DNA of (a). 
5 Another embodiment is directed to fragments of a PRO] 126 polypeptide coding sequence that may find 

use as hybridization probes. Such nucleic acid fragments are from about 20 to about 80 nucleotides in length, 
preferably from about 20 to about 60 nucleotides in length, more preferably from about 20 to about 50 
nucleotides in length and most preferably from about 20 to about 40 nucleotides in length and may be derived 
from the nucleotide sequence shown in Figure 261 (SEQ ID NO:366). 
10 In another embodiment, the invention provides isolated PRO 1 126 polypeptide encoded by any of the 

isolated nucleic acid sequences hereinabove identified. 

In a specific aspect, the invention provides isolated native sequence PRO 11 26 polypeptide, which in 
certain embodiments, includes an amino acid sequence comprising residues 1 or about 26 to about 402 of Figure 
262 (SEQ ID NO:367). 

15 In another aspect, the invention concerns an isolated PRO 1 126 polypeptide, comprising an amino acid 

sequence having a! least about 80% sequence identity, preferably at least about 85% sequence identity, more 
preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the 
sequence of amino acid residues 1 or about 26 to about 402, inclusive of Figure 262 (SEQ ID NO:367). 

In a further aspect, the invention concerns an isolated PRO 1 126 polypeptide, comprising an amino acid 

20 sequence scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least 
about 90% positives, most preferably at least about 95% positives when compared with the amino acid sequence 
of residues 1 or about 26 to about 402, inclusive of Figure 262 (SEQ ID NO:367). 

In yet another aspect, the invention concerns an isolated PRO 1 126 polypeptide, comprising the sequence 
of amino acid residues 1 or about 26 to about 402, inclusive of Figure 262 (SEQ ID NO:367), or a fragment 

25 thereof sufficient to provide a binding site for an anti-PR01126 antibody. Preferably, the PROl 126 fragment 
retains a qualitative biological activity of a native PROl 126 polypeptide. 

In a still further aspect, the invention provides a polypeptide produced by (i) hybridizing a test DNA 
molecule under stringent conditions with (a) a DNA molecule encoding a PRO 1126 polypeptide having the 
sequence of amino acid residues from about 1 or about 26 to about 402, inclusive of Figure 262 (SEQ ID 

30 NO:367), or (b) the complement of the DNA molecule of (a), and if the test DNA molecule has at least about 
an 80% sequence identity, preferably at least about an 85% sequence identity, more preferably at least about a 
90% sequence identity, most preferably at least about a 95% sequence identity to (a) or (b), (ii) culturing a host 
cell comprising the lest DNA molecule under conditions suitable for expression of the polypeptide, and (iii) 
recovering the polypeptide from the cell culmre. 

35 In yet another embodiment, the invention concerns agonists and antagonists of a native PROl 126 

polypeptide. In a particular embodiment, the agonist or antagonist is an anti-PROl 126 antibody. 
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In a further embodiment, the invention concerns a method of identifying agonists or antagonists of a 
native PRO 11 26 polypeptide by contacting the native PRO 11 26 polypeptide with a candidate molecule and 
monitoring a biological activity m^iaied by said polypeptide. 

In a still further embodiment, the invention concerns a composition comprising a PROl 126 polypeptide, 
or an agonist or antagonist as hereinabove defined, in combination with a pharmaceutically acceptable carrier. 

5 

114. PROl 125 

A cDNA clone (DNA606I9-1482) has been identified, having beta-transducin family Trp- Asp (WD) 
conserved regions, that encodes a novel polypeptide, designated in the present application as "PROl 125." 

In one embodiment, the invention provides an isolated nucleic acid molecule comprising DN A encoding 
10 a PROl 125 polypeptide. 

In one aspect, the isolated nucleic acid comprises DNA having at least about 80% sequence identity, 
preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most 
preferably at least about 95% sequence identity to (a) a DNA molecule encoding a PROl 125 polypeptide having 
the sequence of amino acid residues from about 1 or 26 to about 447, inclusive of Figure 264 (SEQ ID NO:369), 
15 or (b) the complement of the DNA molecule of (a). As used herein, **or" when referring to nucleic acids or 
amino acids, refers to two alternative embodiments, i.e., 1-447 and 26-447. 

In another aspect, the invention concerns an isolated nucleic acid molecule encoding a PROl 125 
polypeptide comprising DNA hybridizing to the complement of the nucleic acid between about residues 47 or 
122 and about 1387, inclusive, of Figure 263 (SEQ ID NO:368). Preferably, hybridization occurs under 
20 stringent hybridization and wash conditions. 

In a further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA having 
at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least 
about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule 
encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 209993 
25 (DNA60619- 1482), or (b) the complement of the DNA molecule of (a). In a preferred embodiment, the nucleic 
acid comprises a DNA encoding the same mamre polypeptide encoded by the human protein cDNA in ATCC 
Deposit No. 209993 (DNA60619-1482). 

In a still further aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA 
encoding a polypeptide having at least about 80% sequence identity, preferably at least about 85% sequence 
30 identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence 
identity to the sequence of amino acid residues from about 1 or 26 to about 447, inclusive of Figure 264 (SEQ 
ID NO:369), or the complement of the DNA of (a). 

in a further aspect, the invention concerns an isolated nucleic acid molecule produced by hybridizing 
a test DNA molecule under stringent conditions with (a) a DNA molecule encoding a PROl 125 polypeptide 
35 having the sequence of amino acid residues from about 1 or 26 to about 447, inclusive of Figure 264 (SEQ ID 
NO:369), or (b) the complement of the DNA molecule of (a), and, if the DNA molecule has at least about an 
80 % sequence identity, preferably at least about an 85 % sequence identity, more preferably at least about a 90% 
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sequence identity, nu>st preferably at least about a 95% sequence identity to (a) or (b), isolating the test DNA 
molecule. 

In a specific aspect, the invention provides an isolated nucleic acid molecule comprising DNA encoding 
a PRO 1125 polypeptide, with or without the N-terminal signal sequence and/or the initiating methionuie, and 
its soluble, i.e. transmembrane domain deleted or inactivated variants, or is complementary to such encoding 
5 nucleic acid molecule. The signal peptide has been tentatively identified as extending from amino acid position 
1 through about amino acid position 25 in the sequence of Figure 264 (SEQ ID NO:369). 

In another aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA 
encoding a polypeptide scoring at least about 80% positives, preferably at least about 85% positives, more 
preferably at least about 90% positives, most preferably at least about 95% positives when compared with the 
10 amino acid sequence of residues 1 or 26 to about 447, inclusive of Figure 264 (SEQ ID NO:369), or (b) the 
complement of the DNA of (a). 

In another embodiment, the invention provides isolated PROl 125 polypeptide encoded by any of the 
isolated nucleic acid sequences hereinabove defined. 

In a specific aspect, the invention provides isolated native sequence PROl 125 polypeptide, which in one 
15 embodiment, includes an amino acid sequence comprising residues I or 26 to 447 of Figure 264 (SEQ ID 
NO:369). 

In another aspect, the invention concerns an isolated PROl 125 polypeptide, comprising an amino acid 
sequence having at least about 80% sequence identity, preferably at least about 85% sequence identity, more 
preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the 

20 sequence of amino acid residues 1 or 26 to about 447, inclusive of Figure 264 (SEQ ID NO:369). 

In a further aspect, the invention concerns an isolated PROl 125 polypeptide, comprising an amino acid 
sequence scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least 
about 90% positives, most preferably at least about 95% positives when compared with the amino acid sequence 
of residues 1 or 26 through 447 of Figure 264 (SEQ ID NO:369). 

25 In yet another aspect, the invention concerns an isolated PRO 1 1 25 polypeptide, comprising the sequence 

of amino acid residues 26 to about 447, inclusive of Figure 264 (SEQ ID NO:369), or a fragment thereof 
sufficient to provide a binding site for an anti-PROl 125 antibody. Preferably, the PROl 125 fragment reuins 
a qualiutive biological activity of a native PRO 1 125 polypeptide. 

In a still further aspect, the invention provides a polypeptide produced by (i) hybridizing a test DNA 

30 molecule under stringent conditions with (a) a DNA molecule encoding a PR01125 polypeptide having the 
sequence of amino acid residues from about 26 to about 447, inclusive of Figure 264 (SEQ ID NO:369), or (b) 
the complement of the DNA molecule of (a), and if the test DNA molecule has at least about an 80% sequence 
identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence 
identity, most preferably at least about a 95% sequence identity to (a) or (b), (ii) culturing a host cell comprising 

35 the test DNA molecule under conditions suitable for expression of the polypeptide, and (iii) recovering the 
polypeptide from the cell culture. 
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In yet another emt)odiment, the invention concerns agonists and antagonists of the a native PROi 125 
polypeptide. In a panicular embodiment, the agonist or antagonist is an anti-PROl 125 antibody. 

In a further embodiment, the invention concerns a method of identifying agonists or antagonists of a 
native PROI 125 polypeptide, by contacting the native PROI 125 polypeptide with a candidate molecule and 
monitoring a biological activity mediated by said polypeptide. 

5 

115. PROI 186 

A cDN A clone (DNA60621-1516) has been identified that encodes a novel polypeptide having sequence 
identity with venom protein A and designated in the present application as "PROllSd.** 

In one embodiment, the invention provides an isolated nucleic acid molecule comprising DN A encoding 
10 a PRO! 186 polypeptide. 

in one aspect, the isolated nucleic acid comprises DNA having at least about 80% sequence identity, 
preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most 
preferably at least about 95 % sequence identity to (a) a DNA molecule encoding a PRO 1 1 86 polypeptide having 
the sc^quence of amino acid residues from about 20 to about 105, iiKlusive of Figure 266 (SEQ ID NO:371), or 
15 (b) the complement of the DNA molecule of (a). 

In another aspect, the invention concerns an isolated nucleic acid molecule encoding a PROI 186 
polypeptide comprising DNA hybridizing to the complement of the nucleic acid between about residues 148 and 
about 405. inclusive, of Figure 265 (SEQ ID NO:370). Preferably, hybridization occurs under stringent 
hybridization and wash conditions. 
20 In a further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA having 

at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least 
about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule 
encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposu No. 203091 
(DNA60621-1516), or (b) the complement of die DNA molecule of (a). In a preferred embodiment, the nucleic 
25 acid comprises a DNA encoding the same mature polypeptide encoded by the human protein cDN A in ATCC 
Deposrt No. 203091 (DNA60621.1516). 

In a still further aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA 
encoding a polypeptide having at least about 80% sequence identity, preferably at least about 85% sequence 
identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence 
30 identity to the sequence of amino acid residues from about 20 to about 105, inclusive of Figure 266 (SEQ ID 
NO:371), or the complement of the DNA of (a). 

In a funher aspect, the invention concerns an isolated nucleic acid molecule having at least about 50 
nucleotides, and preferably at least about 100 nucleotides and produced by hybridizing a test DNA molecule 
tmder stringent conditions with (a) a DNA molecule encoding a PROI 186 polypeptide having the sequence of 
35 amino acid residues from about 20 to about 105, inclusive of Figure 266 (SEQ ID NO:37l), or (b) the 
complement of the DNA molecule of (a), and, if the DNA molecule has at least about an 80% sequence identity, 
preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most 
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preferably at least about a 95% sequence identity to (a) or (b), isolating the test DNA molecule. 

In another aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA 
encoding a polypeptide scoring at least about 80% positives, preferably at least about 85% positives, more 
preferably at least about 90% positives, most preferably at least about 95% positives when compared with the 
amino acid sequence of residues 20 to about 105. inclusive of Figure 266 (SEQ ID NO:371). or (b) the 
5 complement of the DNA of (a). 

Another embodiment is directed to fragments of a PR01186 polypeptide coding sequence that may find 
use as hybridization probes. Such nucleic acid fragments are from about 20 through about 80 nucleotides in 
length, preferably from about 20 through about 60 nucleotides in length, more preferably from about 20 through 
about 50 nucleotides in length, and most preferably from about 20 through about 40 nucleotides in length. 
10 In another embodiment, the invention provides isolated PROl 186 polypeptide encoded by any of the 

isolated nucleic acid sequences hereinabove defmed. 

In a specific aspect, the invention provides isolated native sequence PROl 186 polypeptide, which in one 
embodiment, includes an amino acid sequence comprising residues 20 through 105 of Figure 266 (SEQ ID 
NO:371). 

15 In another aspect, the invention concerns an isolated PROl 186 polypeptide, comprising an amino acid 

sequence having at least about 80% sequence identity, preferably at least about 85% sequence identity, more 
preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the 
sequence of amino acid residues 20 to about 105, inclusive of Figure 266 (SEQ ID NO:37l). 

In a further aspect, the invention concerns an isolated PROl 186 polypeptide, comprising an amino acid 

20 sequence scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least 
about 90% positives, most preferably at least about 95 % positives when compared with the amino acid sequence 
of residues 20 through 105 of Figure 266 (SEQ ID NO:371). 

In yet another aspect, the invention concerns an isolated PRO 1 1 86 polypeptide, comprising the sequence 
of amino acid residues 20 to about 105, inclusive of Figure 266 (SEQ ID NO:371), or a fragment thereof 

25 sufficient to provide a binding site for an anti-PROI 186 antibody. Preferably, the PROl 186 fragment retains 
a qualitative biological activity of a native PROl 186 polypeptide. 

In a still funher aspect, the invention provides a polypeptide produced by (i) hybridizing a test DNA 
molecule under stringent conditions with (a) a DNA molecule encoding a PROl 186 polypeptide having the 
sequence of amino acid residues from about 20 to about 105, inclusive of Figure 266 (SEQ ID NO:371), or (b) 

30 the complement of the DNA molecule of (a), and if the test DNA molecule has at least about an 80% sequence 
identity, preferably at least about an 85% sequence identity, more preferably ai least about a 90% sequence 
identity, most preferably at least about a 95% sequence identity to (a) or (b). (ii) culturing a host cell comprising 
the test DNA molecule under conditions suitable for expression of the polypeptide, and (iii) recovering the 
polypeptide from the cell culmre. 

35 In yet another embodiment, the invention concerns agonists and antagonists of the a native PROl 186 

polypeptide. In a panicular embodiment, the agonist or antagonist is an anti-PROl 186 antibody. 
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In a funher embodiment, the invention concerns a method of identifying agonists or antagonists of a 
native PR01186 polypeptide, by contacting the native PR01186 polypeptide with a candidate molecule and 
monitoring a biological activity mediated by said polypeptide. 

In a still further embodiment, the invention concerns a composition comprising a PRO 1 186 polypeptide, 
or an agonist or antagonist as hereinabove defined, in combination with a pharmaceuiically acceptable carrier. 

5 

116. PR01198 

A cDNA clone (DNA60622-I525) has been identified that encodes a novel secreted polypeptide 
designated in the present plication as ''PR01198." 

In one embodiment, the invention provides an isolated nucleic acid molecule comprising DNA encoding 
10 a PROl 198 polypeptide. 

In one aspect, the isolated nucleic acid comprises DNA having at least about 80% sequence identity, 
preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most 
preferably at least about 95% sequence identity to (a) a DNA molecule encoding a PROl 198 polypeptide having 
the sequence of amino acid residues from about 35 to about 229. inclusive of Figure 268 (SEQ ID NO:373), or 
15 (b) the complement of the DNA molecule of (a). 

In another aspect, the invention concerns an isolated nucleic acid molecule encoding a PROl 198 
polypeptide comprising DNA hybridizing to the complement of the nucleic acid between about residues 156 and 
about 740. inclusive, of Figure 268 (SEQ ID NO:373). Preferably, hybridization occurs under stringent 
hybridization and wash conditions. 
20 In a furdier aspect, the invention concerns an isolated nucleic acid molecule comprising DNA having 

at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least 
about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule 
encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203090 
(DNA60622-1525), or (b) the complement of the DNA molecule of (a). In a preferred embodiment, the nucleic 
25 acid comprises a DNA encoding the same mature polypeptide encoded by the human protein cDN A in ATCC 
Deposit No. 203090 (DNA60622-1525). 

In a still further aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA 
encoding a polypeptide having at least about 80% sequence identity, preferably at least about 85% sequence 
identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence 
30 identity to the sequence of amino acid residues from about 35 to about 229. inclusive of Figure 268 (SEQ ID 
NO:373), or the complement of the DNA of (a). 

In a further aspect, the invention concerns an isolated nucleic acid molecule having at least about 50 
nucleotides, and preferably at least about 100 nucleotides and produced by hybridizing a test DNA molecule 
under stringent conditions with (a) a DNA molecule encoding a PROl 198 polyf>eptide having the sequence of 
35 amino acid residues from about 35 to about 229, inclusive of Figure 268 (SEQ ID NO:373), or (b) the 
complement of the DNA molecule of (a), and, if the DNA molecule has at least about an 80% sequence identity, 
preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most 
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preferably at least about a 95% sequence identity to (a) or (b), isolating the test DNA molecule. 

In a specific aspect, the invention provides an isolated nucleic acid molecule comprising DNA encoding 
a PROl 198 polypeptide, with or without the N-terminal signal sequence and/or the initialing methionine, or is 
complementary to such encoding nucleic acid molecule. The signal peptide has been tentatively identified as 
extending from about amino acid position 1 through about amino acid position 35 in the sequence of Figure 268 
5 (SEQ ID NO:373). 

In another aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA 
encoding a polypeptide scoring at least about 80% positives, preferably at least about 85% positives, more 
preferably at least about 90% positives, most preferably at least about 95% positives when compared with the 
amino acid sequence of residues 35 to about 229, inclusive of Figure 268 (SEQ ID NO:373). or (b) the 
10 complement of die DNA of (a). 

Another embodiment is directed to fragments of a PROl 1 98 polypeptide coding sequence diat may find 
use as hybridization probes. Such nucleic acid fragments are from about 20 to about 80 nucleotides in length, 
preferably from about 20 to about 60 nucleotides in length, more preferably from about 20 to about 50 
nucleotides in length, and most preferably from about 20 to about 40 nucleotides in length. 
15 In anodier embodiment, the invention provides isolated PROl 198 polypeptide encoded by any of the 

isolated nucleic acid sequences hereinabove defined. 

In a specific aspect, the invention provides isolated native sequence PROl 198 polypeptide, which in one 
embodiment, includes an amino acid sequence comprising residues 35 to 229 of Figure 268 (SEQ ID NO:373). 
In another aspect, the invention concerns an isolated PROl 198 polypeptide, comprising an amino acid 
20 sequence having at least about 80% sequence identity, preferably at least about 85% sequence identity, more 
preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the 
sequence of amino acid residues 35 to about 229, inclusive of Figure 268 (SEQ ID NO:373). 

In a further aspect, the invention concerns an isolated PROl 198 polypeptide, comprising an amino acid 
sequence scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least 
25 about 90% positives, most preferably at least about 95% positives when compared with the amino acid sequence 
of residues 35 to 229 of Figure 268 (SEQ ID NO:373). 

In yet another aspect, the invention concerns an isolated PROl 198 polypeptide, comprising the sequence 
of amino acid residues 35 to about 229, inclusive of Figure 268 (SEQ ID NO:373), or a fragment thereof 
sufficient to provide a binding site for an anti-PROl 198 antibody. Preferably, the PROl 198 fragment retains 
30 a qualitative biological activity of a native PROl 198 polypeptide. 

In a still further aspect, the invention provides a polypeptide produced by (i) hybridizing a test DNA 
molecule under stringent conditions with (a) a DNA molecule encoding a PROl 198 polypeptide having the 
sequence of amino acid residues from about 35 to about 229, inclusive of Figure 268 (SEQ ID NO:373). or (b) 
the complement of the DNA molecule of (a), and if the test DNA molecule has at least about an 80% sequence 
35 identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence 
identity, most preferably at least about a 95% sequence identity to (a) or (b), (ii) culturing a host cell comprising 
the test DNA molecule imder conditions suitable for expression of the polypeptide, and (iii) recovering the 
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polypeptide from the cell culture. 
117. PRQ11S8 

A cDN A clone (DNA60625-1507) has been identified that encodes a novel transmembrane polypeptide, 
designated in the present application as "PR01158". 
5 In one embodiment, the invention provides an isolated nucleic acid molecule comprising DN A encoding 

a PRO 1158 polypeptide. 

In one aspect, the isolated nucleic acid comprises DNA having at least about 80% sequence identity, 
preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most 
preferably at least about 95% sequence identity to (a) a DNA molecule encoding a PROl 158 polypeptide having 
10 the sequence of amino acid residues from about 20 to about 123, inclusive of Figure 270 (SEQ ID NO:375). or 
(b) the complement of the DNA molecule of (a). 

In another aspect, the invention concerns an isolated nucleic acid molecule encoding a PROl 158 
polypeptide comprising DNA hybridizing to the complement of the nucleic acid between about residues 220 and 
about 531, inclusive, of Figure 269 (SEQ ID NO:374). Preferably, hybridization occurs under stringent 
15 hybridization and wash conditions. 

In a further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA having 
at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least 
about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule 
encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 209975 
20 (DNA60625-1 507), or (b) the complement of the DNA molecule of (a). In a preferred embodiment, the nucleic 
acid comprises a DNA encoding the same manire polypeptide encoded by the human protein cDNA in ATCC 
Deposit No. 209975 (DNA60625-1507). 

in a still further aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA 
encoding a polypeptide having at least about 80% sequence identity, preferably at least about 85% sequence 
25 identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence 
identity to the sequence of amino acid residues from about 20 to about 123, inclusive of Figure 270 (SEQ ID 
NO:375). or the complement of the DNA of (a). 

In a further aspect, the invention concerns an isolated nucleic acid molecule having at least about 50 
nucleotides, and preferably at least about 100 nucleotides and produced by hybridizing a test DNA molecule 
30 under stringent conditions with (a) a DNA molecule encoding a PROl 158 polypeptide having the sequence of 
amino acid residues from about 20 to about 123, inclusive of Figure 270 (SEQ ID NO:375), or (b) the 
complement of the DNA molecule of (a), and, if the DNA molecule has ai least about an 80 % sequence identity, 
preferably ai least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most 
preferably at least about a 95% sequence identity to (a) or (b), isolating the test DNA molecule. 
35 In a specific aspect, the invention provides an isolated nucleic acid molecule comprising DNA encoding 

a PROl 158 polypeptide, with or without the N-terminal signal sequence and/or the inhiating methionine, and 
its soluble, i.e. transmembrane domain deleted or inactivated variants, or is complementary to such encoding 
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nucleic acid molecule. The signal peptide has been tentatively identified as extending from about amino acid 
position 1 to about amino acid position 1 9 in the sequence of Figure 270 (SEQ ID NO:375), The transmembrane 
domain has been tentatively idemified as extending from about amino acid position 56 to about amino acid 
position 80 in the PROl 158 amino acid sequence (Figure 270, SEQ ID NO:375). 

In another aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA 
encoding a polypeptide scoring at least about 80% positives, preferably at least about 85% positives, more 
preferably at least about 90% positives, most preferably at least about 95% positives when compared with the 
amino acid sequence of residues 20 to about 123, inclusive of Figure 270 (SEQ ID NO:375). or (b) the 
complement of the DNA of (a). 

In another aspect, the mvention concerns hybridization probes that comprise fragments of the PROl 158 
coding sequence, or complementary sequence thereof. The hybridization probes preferably have at least about 
20 nucleotides to about 80 nucleotides, and more preferably, at least about 20 to about 50 nucleotides. 

In another embodiment, the invention provides isolated PROl 158 polypeptide encoded by any of the 
isolated nucleic acid sequences hereinabove defined. 

In a specific aspect, the invention provides isolated native sequence PROl 158 polypeptide, which in one 
15 embodiment, includes an amino acid sequence comprising residues 20 to 123 of Figure 270 (SEQ ID NO:375). 

In another aspect, the invention concerns an isolated PROl 158 polypeptide, comprising an amino acid 
sequence having at least about 80% sequence identity, preferably at least about 85% sequence identity, more 
preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the 
sequence of amino acid residues 20 to about 123, inclusive of Figure 270 (SEQ ID NO:375). 
20 In a further aspect, the invention concerns an isolated PROl 158 polypeptide, comprising an amino acid 

sequence scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least 
about 90% positives, most preferably at least about 95% positives when compared with the amino acid sequence 
of residues 20 to 123 of Figure 270 (SEQ ID NO:375). 

In yet another aspect, the invention concerns an isolated PRO 1 1 58 polypeptide, comprising the sequence 
25 of amino acid residues 20 to about 123. inclusive of Figure 270 (SEQ ID NO:375), or a fragmem thereof 
sufficiem to provide a binding site for an anti-PROl 158 amibody. Preferably, the PROl 158 fragmem retains 
a qualitative biological activity of a native PROl 158 polypeptide. 

In a still hirther aspect, the invention provides a polypeptide produced by (i) hybridizing a test DNA 
molecule under stringent conditions with (a) a DNA molecule encoding a PROl 158 polypeptide having the 
30 sequence of amino acid residues from about 20 to about 123. inclusive of Figure 270 (SEQ ID NO:375), or (b) 
the complemem of the DNA molecule of (a), and if the test DNA molecule has at least about an 80% sequence 
identity, preferably at least about an 85% sequence idemity, more preferably at least about a 90% sequence 
identity, most preferably at least about a 95% sequence identity to (a) or (b), (ii) culturing a host cell comprising 
the test DNA molecule under conditions suitable for expression of the polypeptide, and (iii) recovering the 
35 polypeptide from the cell culture. 
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118. PR01159 

A cDNA clone (DNA60627-1508) has been idemified that encodes a novel secreted polypeptide, 
designated in the present application as "PR01159". 

In one embodiment, the invention provides an isolated nucleic acid molecule comprising DN A encoding 
a PR01159 polypeptide. 

5 In one aspect, the isolated nucleic acid comprises DNA having at least about 80% sequence identity, 

preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most 
preferably ai least about 95% sequence identity to (a) a DNA molecule encoding a PROl 159 polypeptide having 
the sequence of amino acid residues from about 1 or about 16 to about 90, inclusive of Figure 272 (SEQ ID 
NO:377), or (b) the complement of the DNA molecule of (a). 

10 in another aspect, the invention concerns an isolated nucleic acid molecule encoding a PROl 159 

polypeptide comprising DNA hybridizing to the complement of the nucleic acid between about nucleotides 92 
or about 137 and about 361 . inclusive, of Figure 271 (SEQ ID NO:376). Preferably, hybridization occurs under 
stringent hybridization and wash conditions. 

In a further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA having 

15 at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least 
about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule 
encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203092 
(DNA60627-I508) or (b) the complement of the nucleic acid molecule of (a). In a preferred embodiment, the 
nucleic acid comprises a DNA encoding the same mature polypeptide encoded by the human protein cDNA in 

20 ATCC Deposit No. 203092 {DNA60627-1508). 

In still a further aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA 
encoding a polypeptide having at least about 80% sequence identity, preferably at least about 85% sequence 
identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence 
identity to the sequence of amino acid residues 1 or about 16 to about 90, inclusive of Figure 272 (SEQ ID 

25 NO:377), or (b) the complement of the DNA of (a). 

In a further aspect, the invention concerns an isolated nucleic acid molecule having at least 10 
nucleotides and produced by hybridizing a test DNA molecule under stringent conditions with (a) a DNA 
molecule encoding a PROl 159 polypeptide having the sequence of amino acid residues from 1 or about 16 to 
about 90, inclusive of Figure 272 (SEQ ID NO:377), or (b) the complement of the DNA molecule of (a), and, 

30 if the DNA molecule has at least about an 80 % sequence identity, prefercably at least about an 85% sequence 
identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95% sequence 
identity to (a) or (b), isolating the lest DNA molecule. 

In a specific aspect, the invention provides an isolated nucleic acid molecule comprising DNA encoding 
a PROl 159 polypeptide, with or without the N-tcrminal signal sequence and/or the initiating methionine, or is 

35 complementary to such encoding nucleic acid molecule. The signal peptide has been tentatively identified as 
extending from about amino acid position 1 to about amino acid position 15 in the sequence of Figure 272 (SEQ 
ID NO:3T7). 
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In anoiher aspect, ihe mvention concerns an isolated nucleic acid molecule comprising (a) DNA 
encoding a polypeptide scoring at least about 80% positives, preferably at least about 85% positives, more 
preferably at least alwut 90% positives, most preferably at least about 95% positives when compared with the 
amino acid sequence of residues 1 or about 16 to about 90, inclusive of Figure 272 (SEQ ID NO:377), or (b) 
the complement of the DNA of (a). 
5 Anoiher embodiment is directed to fragments of a PRO 1 1 59 polypeptide coding sequence that may find 

use as hybridization probes. Such nucleic acid fragments are from about 20 to about 80 nucleotides in length, 
preferably from about 20 to about 60 nucleotides in length, more preferably from about 20 to about 50 
nucleotides in length and most preferably from about 20 to about 40 nucleotides in length and may be derived 
from the nucleotide sequence shown in Figure 271 (SEQ ID NO:376). 
10 In another embodiment, the invention provides isolated PROl 159 polypeptide encoded by any of the 

isolated nucleic acid sequences hereinabove identified. 

In a specific aspect, the invention provides isolated native sequence PROl 159 polypeptide, which in 
cenain embodiments, includes an amino acid sequence comprising residues 1 or about 16 to about 90 of Figure 
272 (SEQ ID NO:377). 

15 In another aspect, the invention concerns an isolated PRO! 159 polypeptide, comprising an amino acid 

sequence having at least about 80% sequence identity, preferably at least about 85% sequence identity, more 
preferably at least about 90% sequence identity, most preferably at leasi about 95% sequence identity to the 
sequence of amino acid residues 1 or about 16 to about 90, inclusive of Figure 272 (SEQ ID NO:377). 

In a further aspect, the invention concerns an isolated PROl 159 polypeptide, comprising an amino acid 

20 sequence scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least 
about 90% positives, most preferably at least about 95 % positives when compared with the amino acid sequence 
of residues 1 or about 16 to about 90. inclusive of Figure 272 (SEQ ID NO:377). 

In yet another aspect, the invention concerns an isolated PROl 159 polypeptide, comprising the sequence 
of amino acid residues 1 or about 16 to about 90, inclusive of Figure 272 (SEQ ID NO:377), or a fragment 

25 thereof sufficient to provide a binding site for an anti-PROl 159 antibody. Preferably, the PROl 159 fragment 
retains a qualitative biological activity of a native PROl 159 polypeptide. 

In a still further aspect, the invention provides a polypeptide produced by (i) hybridizing a test DNA 
molecule under stringent conditions with (a) a DNA molecule encoding a PROl 159 polypeptide having the 
sequence of amino acid residues from about 1 or about 16 to about 90, inclusive of Figure 272 (SEQ ID 

30 NO:377), or (b) the complement of the DNA molecule of (a), and if the test DNA molecule has at least about 
an 80% sequence identity, preferably at least about an 85% sequence identity, more preferably at least about a 
90% sequence identity, most preferably at least about a 95% sequence identity to (a) or (b), (ii) culturing a host 
cell comprising the test DNA molecule under conditions suitable for expression of the polypeptide, and (iii) 
recovering the polypeptide from the cell culwre. 

35 
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119. PR01124 

A cDNA clone (DNA60629- 1481) has been identified, having sequence identity with a chloride channel 
proiein and lung-endoihelial cell adhesion molecule-l (EAM-1) thai encodes a novel polypeptide, designated in 
the present application as "PRO! 124." 

In one embodiment, the invention provides an isolated nucleic acid molecule comprising DN A encoding 
5 a PRO 1 1 24 polypeptide. 

In one aspect, the isolated nucleic acid comprises DNA having ai least about 80% sequence identity, 
preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most 
preferably at least about 95% sequence identity to (a) a DNA molecule encoding a PRO] 124 polypeptide having 
the sequence of amino acid residues from about I or 22 to about 919, inclusive of Figure 274 (SEQ ID NO:379). 
10 or fli) the complement of the DNA molecule of (a): As used herein, "or", i.e., 1 or 22 and 25 or 88, is used 
to describe two alternative embodiments. For example, the invention includes amino acids 1 through 919 and 
in an alternative embodiment, provides amino acids 22 through 919, etc. 

In another aspect, the invention concerns an isolated nucleic acid molecule encoding a PR01124 
polypeptide comprising DNA hybridizing to the complement of the nucleic acid between about residues 25 or 
15 88 and about 2781, inclusive, of Figure 273 (SEQ ID NO:378). In another aspect, the invention concerns an 
isolated nucleic acid molecule hybridizing to the complement of the nucleic acid of SEQ ID NO:378. Preferably, 
hybridization occurs under stringent hybridization and wash conditions. 

In a further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA having 
at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least 
20 about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule 
encoding the same mature polypeptide encoded by the human proiein cDNA in ATCC Deposit No. 209979 
(DNA60629-1481). or (b) the complement of the DNA molecule of (a). In a preferred embodiment, the nucleic 
acid comprises a DNA encoding the same mature polypeptide encoded by the human protein cDNA in ATCC 
Deposit No. 209979 (DNA60629-148I). 
25 In a still further aspect, die invention concerns an isolated nucleic acid molecule comprising (a) DNA 

encoding a polypeptide having at least about 80% sequence identity, preferably at least about 85% sequence 
identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence 
identity to the sequence of amino acid residues from about 1 or 22 to about 919. inclusive of Figure 274 (SEQ 
ID NO:379), or the complement of the DNA of (a). 
30 In a specific aspect, the invention provides an isolated nucleic acid molecule comprising DNA encoding 

a PROl 124 polypeptide, with or without the N-terminal signal sequence and/or the initiating methionine, and 
its soluble, i.e. transmembrane domain deleted or inactivated variants, or is complementary to such encoding 
nucleic acid molecule. The cytoplasmic end can be excluded as well. The signal peptide has been tentatively 
identified as extending from amino acid position 1 to about amino acid position 21 in the sequence of Figure 274 
35 (SEQ ID NO: 379). The transmembrane domains have been tentatively identified as extending from about amino 
acid position 284 to about amino acid position 300 and from about amino acid position 617 to about amino acid 
position 633 in the amino acid sequence (Figure 274, SEQ ID NO:379). 
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In another aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA 
encoding a polypeptide scoring at least about 80% positives, preferably at least about 85% positives, more 
preferably at least about 90% positives, most preferably at least about 95% positives when compared with the 
amino acid sequence of residues 1 or 22 to about 919, inclusive of Figure 274 (SEQ ID NO:379). or (b) the 
complement of the DNA of (a). 
5 In another embodiment, the invention provides isolated PROl 124 polypeptide encoded by any of the 

isolated nucleic acid sequences hereinabove defmed. 

In a specific aspect, the invention provides isolated native sequence PROl 124 polypeptide, which in one 
embodiment, includes an amino acid sequence comprising residues 1 or 22 through 919 of Figure 274 (SEQ ID 
NO:379). 

10 In another aspect, the invention concerns an isolated PROl 124 polypeptide, comprising an amino acid 

sequence having at least about 80% sequence identity, preferably at least about 85% sequence identity, more 
preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the 
sequence of amino acid residues 1 or 22 to about 919. inclusive of Figure 274 (SEQ ID NO:379). 

In a further aspect, the invention concerns an isolated PROl 124 polypeptide, comprising an amino acid 
15 sequence scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least 
about 90% positives, most preferably at least about 95% positives when compared with the amino acid sequence 
of residues 1 or 22 to 919 of Figure 274 (SEQ ID NO:379). 

In a still further aspect, the invention provides a polypeptide produced by (i) hybridizing a test DNA 
molecule under stringent conditions with (a) a DNA molecule encoding a PROl 124 polypeptide having the 
20 sequence of amino acid residues from about 1 or 22 to about 919, inclusive of Figure 274 (SEQ ID NO:379), 
or (b) the complement of the DNA molecule of (a), and if the test DNA molecule has at least about an 80% 
sequence identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% 
sequence identity, most preferably at least about a 95% sequence identity to (a) or (b), (ii) culiuring a host cell 
comprising the test DNA molecule under conditions suitable for expression of the polypeptide, and (iii) 
25 recovering the polypeptide from the cell culture. 

In yet another embodiment, the invention concerns agonists and antagonists of the a native PROl 124 
polypeptide. In a panicular embodiment, the agonist or antagonist is an anii-PROl 124 antibody. 

In a further embodiment, the invention concerns a method of identifying agonists or antagonists of a 
native PROl 124 polypeptide, by contacting the native PROl 124 polypeptide with a candidate molecule and 
30 monitoring an activity mediated by said polypeptide. 

in a still further embodiment, the invention concerns a composition comprising a PROl 124 polypeptide, 
or an agonist or antagonist as hereinabove defined, in combination with a pharmaceutically acceptable carrier. 

120. PR01287 

35 A cDNA clone (DNA61755- 1554) has been identified, having homology to nucleic acid encoding fringe 

protein, that encodes a novel polypeptide, designated in the present application as "PR01287". 
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In one embodiment, the invention provides an isolated nucleic acid molecule comprising DN A encoding 
a PR01287 polypeptide. 

In one aspect, the isolated nucleic acid comprises DNA having at least about 80% sequence identity, 
preferably at least alwut 85% sequence idemity, more preferably at least about 90% sequence identity, most 
preferably at least about 95% sequence ideniiiy to (a) a DNA molecule encoding a PRO 1287 polypeptide having 
5 the sequence of amino acid residues from about 1 or about 28 to about 532, inclusive of Figure 276 (SEQ ID 
NO:381), or (b) the complement of the DNA molecule of (a). 

In another aspect, the invention concerns an isolated nucleic acid molecule encoding a PRO 1287 
polypeptide comprising DNA hybridizing to the complement of the nucleic acid betv^een about nucleotides 655 
or about 736 and about 2250, inclusive, of Figure 275 (SEQ ID NO:380). Preferably, hybridization occurs 
10 under stringent hybridization and wash conditions. 

In a further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA having 
at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least 
about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule 
encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203112 
15 (DNA61755-1554) or (b) the complement of the nucleic acid molecule of (a). In a preferred embodiment, the 
nucleic acid comprises a DNA encoding the same manire polypeptide encoded by the human protein cDNA in 
ATCC Deposit No. 203112 (DNA6 1755- 1554). 

In still a further aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA 
encoding a polypeptide having at least about 80% sequence identity, preferably at least about 85% sequence 
20 identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence 
identity to the sequence of amino acid residues 1 or about 28 to about 532, inclusive of Figure 276 (SEQ ID 
NO:381), or (b) the complement of the DNA of (a). 

In a further aspect, the invention concerns an isolated nucleic acid molecule having at least 100 
nucleotides and produced by hybridizing a test DNA molecule under stringent conditions with (a) a DNA 
25 molecule encoding a PR01287 polypeptide having the sequence of amino acid residues from 1 or about 28 to 
about 532. inclusive of Figure 276 (SEQ ID NO:381). or (b) the complement of the DNA molecule of (a), and, 
if the DNA molecule has at least about an 80 % sequence identity, prefereably at least about an 85% sequence 
identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95% sequence 
identity to (a) or (b), isolating the test DNA molecule. 
30 In a specific aspect, the invention provides an isolated nucleic acid molecule comprising DNA encoding 

a PR01287 polypeptide, with or without the N-terminal signal sequence and/or the initiating methionine, or is 
complementary to such encoding nucleic acid molecule. The signal peptide has been tentatively identified as 
extending from about amino acid position I to about amino acid position 27 in the sequence of Figure 276 (SEQ 
ID NO:381). 

35 In another aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA 

encoding a polypeptide scoring at least about 80% positives, preferably at least about 85% positives, more 
preferably at least about 90% positives, most preferably at least about 95% positives when compared v^^iih the 
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amino acid sequence of residues 1 or about 28 to about 532, inclusive of Figure 276 (SEQ ID NO:381), or (b) 
the complement of the DNA of (a). 

Another embodiment is directed to fragments of a PRO 1287 polypeptide coding sequence that may fmd 
use as hybridization prot^s. Such nucleic acid fragments are from about 20 to about 80 nucleotides in length, 
preferably from about 20 to about 60 nucleotides in length, more preferably from about 20 to about 50 
5 nucleotides in length and most preferably from about 20 to about 40 nucleotides in length and may be derived 
from the nucleotide sequence shown in Figure 275 (SEQ ID NO:380). 

In another embc^imem, the invention provides isolated PRO 1287 polypeptide encoded by any of the 
isolated nucleic acid sequences hereinabove identified. 

In a specific aspect, the invention provides isolated native sequence PRO 1287 polypeptide, which in 
10 certain embodiments, includes an amino acid sequence comprising residues 1 or about 28 to about 532 of Figure 
276 (SEQ ID NO:381). 

In another aspect, the invention concerns an isolated PR01287 polypeptide, comprising an amino acid 
sequence having at least about 80% sequence identity, preferably at least about 85% sequence identity, more 
prefeiably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the 

15 sequence of amino acid residues 1 or about 28 to about 532, inclusive of Figure 276 (SEQ ID NO:381), 

In a further aspect, the invention concerns an isolated PROI287 polypeptide, comprising an amino acid 
sequence scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least 
about 90% positives, most preferably at least about 95% positives when compared with the amino acid sequence 
of residues 1 or about 28 to about 532, inclusive of Figure 276 (SEQ ID NO:381). 

20 In yet another aspect, the invention concerns an isolated PRO 1 287 polypeptide , comprising the sequence 

of amino acid residues 1 or about 28 to about 532, inclusive of Figure 276 (SEQ ID NO:381), or a fragment 
thereof sufficient to provide a binding site for an anti-PR01287 antibody. Preferably, the PR01287 fragment 
retains a qualitative biological activity of a native PRO 1287 polypeptide. 

In a still further aspect, the invention provides a polypeptide produced by (i) hybridizing a test DNA 

25 molecule under stringent conditions with (a) a DNA molecule encoding a PRO 1287 polypeptide having the 
sequence of amino acid residues from about 1 or about 28 to about 532, inclusive of Figure 276 (SEQ ID 
NO:381), or (b) the complement of the DNA molecule of (a), and if the test DNA molecule has at least about 
an 80% sequence identity, preferably at least about an 85% sequence identity, more preferably at least about a 
90% sequence identity, most preferably at least about a 95% sequence identity to (a) or (b), (ii) culturing a host 

30 cell comprising the test DNA molecule under conditions suitable for expression of the polypeptide, and (iii) 
recovering the polypeptide from the cell culmre. 

In yet another embodiment, the invention concerns agonists and antagonists of a native PR01287 
polypeptide. In a particular embodiment, the agonist or antagonist is an anii-PR01287 antibody. 

In a further embodiment, the invention concerns a method of identifying agonists or antagonists of a 

35 native PRO 1287 polypeptide by contacting the native PRO 1287 polypeptide with a candidate molecule and 
monitoring a biological activity mediated by said polypeptide. 
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Inastill further embodiment, the invention concerns a composition comprising a PR012S7 polypeptide, 
or an agonist or antagonist as hereinabove defined, in combination with a pharmaceutically acceptable carrier. 

121. PR01312 

A cDNA clone (DNA6 1873- 1574) has been identified that encodes a novel transmembrane polypeptide 
5 designated in the present application as "PR01312". 

In one embodiment, the invention provides an isolated nucleic acid molecule comprising DN A encoding 
a PR01312 polypeptide. 

In one aspect, the isolated nucleic acid comprises DNA having at least about 80% sequence identity, 
preferably at least about 85% sequence identity, more preferably at least hboux 90% sequence identity, most 

1 0 preferably at least about 95 % sequence identity to (a) a DNA molecule encoding a PRO 1312 polypeptide having 
the sequence of amino acid residues from about 15 to about 212, inclusive of Figure 278 (SEQ ID NO:387), or 
(b) the complement of the DNA molecule of (a). 

In another aspect, the invention concerns an isolated nucleic acid molecule encoding a PR01312 
polypeptide comprising DNA hybridizing to the complement of the nucleic acid between about residues 49 and 

15 about 642, inclusive, of Figure 277 (SEQ ID NO:386). Preferably, hybridization occurs umler stringent 
hybridization and wash conditions. 

In a further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA having 
at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least 
about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule 

20 encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203132 
(DNA61873-1574), or (b) the complement of the DNA molecule of (a). In a preferred embodiment, the nucleic 
acid comprises a DNA encoding the same mature polypeptide encoded by the human protein cDNA in ATCC 
Deposit No. 203132 (DNA61 873- 1574). 

In a still further aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA 

25 encoding a polypeptide having at least about 80% sequence identity, preferably at least about 85% sequence 
identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence 
identity to the sequence of amino acid residues from about 15 to about 212, inclusive of Figure 278 (SEQ ID 
NO:387), or the complement of the DNA of (a). 

In a further aspect, the invention concerns an isolated nucleic acid molecule having at least about 50 

30 nucleotides, and preferably at least about 100 nucleotides and produced by hybridizing a test DNA molecule 
under stringent conditions with (a) a DNA molecule encoding a PR01312 polypeptide having the sequence of 
amino acid residues from about 15 to about 212, inclusive of Figure 278 (SEQ ID NO:387), or (b) the 
complement of the DNA molecule of (a), and, if the DNA molecule has at least about an 80% sequence identity, 
preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most 

35 preferably at least about a 95% sequence identity to (a) or (b), isolating the test DNA molecule. 

in a specific aspect, the invention provides an isolated nucleic acid molecule comprising DNA encoding 
a PR01312 polypeptide, with or without the N-terminal signal sequence and/or the initiating methionine, and 
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its soluble, i.e. transmembrane domain deleted or inactivated variants, or is complementary to such encoding 
nucleic acid molecule. The signal peptide has been tentatively identified as extending from amino acid position 
1 through about amino acid position 14 in the sequence of Figure 278 (SEQ ID NO:387). The transmembrane 
domain has been tentatively identified as extending from about amino acid position 141 to about amino acid 
position 160 in the PR01312 amino acid sequence (Figure 278, SEQ ID NO:387). 
5 In another aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA 

encoding a polypeptide scoring at least about 80% positives, preferably at least about 85% positives, more 
preferably at least about 90% positives, most preferably at least about 95% positives when compared with the 
amino acid sequence of residues 15 to about 212, inclusive of Figure 278 (SEQ ID NO:387), or (b) the 
complement of the DNA of (a). 
10 Another embodiment is directed to fragments of a PR013 12 polypeptide coding sequence that may find 

use as hybridization probes. Such nucleic acid fragments are from about 20 to about 80 nucleotides in length, 
preferably from about 20 to about 60 nucleotides in length, more preferably from about 20 to about 50 
nucleotides in length, and most preferably from about 20 to about 40 nucleotides in length. 

In another embodiment, the invention provides isolated PR01312 polypeptide encoded by any of the 
15 isolated nucleic acid sequences hereinabove defined. 

In a specific aspect, the invention provides isolated native sequence PRO 13 12 polypeptide, which in one 
embodiment, includes an amino acid sequence comprising residues 15 to 212 of Figure 278 (SEQ ID NO:387), 
In another aspect, the invention concerns an isolated PRO 13 12 polypeptide, comprising an amino acid 
sequence having at least about 80% sequence identity, preferably at least about 85% sequence identity, more 
20 preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the 
sequence of amino acid residues 15 to about 212. inclusive of Figure 278 (SEQ ID NO:387). 

in a further aspect, the invention concerns an isolated PR013 12 polypeptide, comprising an amino acid 
sequence scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least 
about 90% positives, most preferably at least about 95 % positives when compared with the amino acid sequence 
25 of residues 15 to 212 of Figure 278 (SEQ ID NO:387). 

In yet another aspect^ the invention concerns an isolated PRO 1312 polypeptide . comprising the sequence 
of amino acid residues 15 to about 212, inclusive of Figure 278 (SEQ ID NO:387), or a fragment thereof 
sufficient to provide a binding site for an anii-PR0I312 antibody. Preferably, the PR01312 fragment retains 
a qualitative biological activity of a native PR01312 polypeptide. 
30 In a still further aspect, the invention provides a polypeptide produced by (i) hybridizing a test DNA 

molecule under stringent conditions with (a) a DNA molecule encoding a PR01312 polypeptide having the 
sequence of amino acid residues from about 15 to about 212, inclusive of Figure 278 (SEQ ID N0:387), or (b) 
the complement of the DNA molecule of (a), and if the test DNA molecule has at least about an 807c sequence 
identity, preferably at least alx)ut an 85% sequence identity, more preferably at least about a 90% sequence 
35 identity, most preferably at least about a 95% sequence identity to (a) or (b), (ii) culturing a host cell comprising 
the lest DNA molecule tmder conditions suitable for expression of the polypeptide, and (iii) recovering the 
polypeptide from the cell culture. 
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122. PR01192 

A cDN A clone (DNA62814- 152 1 ) has been identified that encodes a novel polypepiidehaving homology 
to myelin PO protein and designated in the present application as *'PR01 192." 

In one embodiment, the invention provides an isolated nucleic acid molecule comprising DN A encoding 
a PR01192 polypeptide. 

5 In one aspect, the isolated nucleic acid comprises DNA having ai least about 80% sequence identity, 

preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most 
preferably at least about 95% sequence identity to (a) a DNA molecule encoding a PROl 192 polypeptide having 
the sequence of amino acid residues from about 22 to about 215, inclusive of Figure 280 (SEQ ID NO:389), or 
(b) the complement of the DNA molecule of (a). 
10 In another aspect, the invention concerns an isolated nucleic acid molecule encoding a PROl 192 

polypeptide comprising DNA hybridizing to the complement of the nucleic acid between about residues 184 and 
about 764. inclusive, of Figure 279 (SEQ ID NO:388). Preferably, hybridization occurs under stringent 
hybridization and wash conditions. 

In a further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA having 
15 at least about 80% sequence identity, preferably at least about 85 % sequence identity, more preferably at least 
about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule 
encoding the same mamre polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203093 
(DNA62814-1521), or (b) the complement of the DNA molecule of (a). In a preferred embodiment, the nucleic 
acid comprises a DNA encoding the same mature polypeptide encoded by the human protein cDNA in ATCC 
20 Deposit No. 203093 (DNA62814-1521). 

In a still further aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA 
encoding a polypeptide having at least about 80% sequence identity, preferably at least about 85% sequence 
identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence 
idenuty to die sequence of amino acid residues from about 22 to about 215. inclusive of Figure 280 (SEQ ID 
25 NO:389), or die complement of die DNA of (a). 

In a further aspect, die invention concerns an isolated nucleic acid molecule having at least about 50 
nucleotides, and preferably at least about 100 nucleotides and produced by hybridizing a test DNA molecule 
under stringent conditions widi (a) a DNA molecule encoding a PROl 192 polypeptide having the sequence of 
amino acid residues from about 22 to about 215, inclusive of Figure 280 (SEQ ID NO:389), or (b) the 
30 complement of the DNA molecule of (a), and. if the DNA molecule has at least about an 80% sequence identity, 
preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most 
preferably at least about a 95% sequence identity to (a) or (b). isolating the test DNA molecule. 

In a specific aspect, die invention provides an isolated nucleic acid molecule comprising DNA encoding 
a PROl 192 polypeptide, with or without the N-terminal signal sequence and/or the initialing mediionine. and 
35 its soluble, i.e. transmembrane domain deleted or inactivated variants, or is complementary to such encoding 
nucleic acid molecule. The signal peptide has been tentatively identified as extending from amino acid position 
1 through about amino acid position 21 in the sequence of Figure 280 (SEQ ID NO:389). The transmembrane 
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domain has been tentatively identified as extending from about amino acid position 153 through about amino acid 
position 176 in the PR01192 amino acid sequence (Figure 280. SEQ ID NO:389). 

In another aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA 
encoding a polypeptide scoring at least about 80% positives, preferably at least about 85% positives, more 
preferably at least about 90% jwsitives, most preferably at least about 95% positives when compared with the 
5 amino acid sequence of residues 22 to about 215, inclusive of Figure 280 (SEQ ID NO:389). or (b) the 
complement of the DNA of (a). 

Another embodiment is directed to fragments of a PRO 1 1 92 polypeptide coding sequence that may find 
use as hybridization probes. Such nucleic acid fragments are from about 20 to about 80 nucleotides in length, 
preferably from about 20 to about 60 nucleotides in length, more preferably from about 20 to about 50 
10 nucleotides in length, and most preferably from about 20 to about 40 nucleotides in length. 

In another embodiment, the invention provides isolated PROl 192 polypeptide encoded by any of the 
isolated nucleic acid sequences hereinabove defined. 

In a specific aspect, the invention provides isolated native sequence PRO 1 1 92 polypeptide, which in one 
embodiment, includes an amino acid sequence comprising residues 22 to 215 of Figure 280 (SEQ ID NO:389). 
15 In another aspect, the invention concerns an isolated PROl 192 polypeptide, comprising an amino acid 

sequence having at least about 80% sequence identity, preferably at least about 85% sequence identity, more 
preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the 
sequence of amino acid residues 22 to about 215, inclusive of Figure 280 (SEQ ID NO:389). 

In a further aspect, the invention concerns an isolated PROl 192 polypeptide, comprising an amino acid 
20 sequence scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least 
about 90% positives, most preferably at least about 95% positives when compared with the amino acid sequence 
of residues 22 to 215 of Figure 280 (SEQ ID NO:389). 

In yet another aspect, the invention concerns an isolated PRO 1 1 92 polypeptide, comprising the sequence 
of amino acid residues 22 to about 215, inclusive of Figure 280 (SEQ ID NO:389), or a fragment thereof 
25 sufficient to provide a binding site for an anti-PROl 192 antibody. Preferably, the PROl 192 fragment retains 
a qualitative biological activity of a native PROl 192 polypeptide. 

In a still further aspect, the invention provides a polypeptide produced by (i) hybridizing a test DNA 
molecule under stringent conditions with (a) a DNA molecule encoding a PROl 192 polypeptide having the 
sequence of amino acid residues from about 22 to about 215, inclusive of Figure 280 (SEQ ID NO:389), or (b) 
30 the complement of the DNA molecule of (a), and if the test DNA molecule has at least about an 80% sequence 
identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence 
identity, most preferably at least about a 95% sequence identity to (a) or (b). (ii) culmring a host cell comprising 
the test DNA molecule under conditions suitable for expression of the polypeptide, and (iii) recovering the 
polypeptide from the cell culture. 
35 In yet another embodiment, the invention concerns agonists and antagonists of the a native PROl 192 

polypeptide. In a panicular embodiment, the agonist or antagonist is an anti-PROl 192 antibody. 
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In a further embodiment, the invention concerns a method of identifying agonists or antagonists of a 
native PR0n92 polypeptide, by contacting the native PR01192 polypeptide with a candidate molecule and 
monitoring a biological activity mediated by said polypeptide. 

In a still further embodiment, the invention concerns a composition comprising a PROI 192 polypeptide, 
or an agonist or anugonist as hereinabove defined, in combination with a pharmaceutically acceptable carrier 

5 

123. PROI 160 

A cDNA clone (DNA62872-I509) has been identified that encodes a novel secreted polypeptide, 
designated in the present application as 'PROI 160'. 

In one embodiment, the invention provides an isolated nucleic acid molecule comprising DNA encoding 
10 a PROI 160 polypeptide. 

In one aspect, the isolated nucleic acid comprises DNA having at least about 80% sequence idemiiy, 
preferably at least about 85% sequence idemity. more preferably at least about 90% sequence identity, most 
preferably at least about 95% sequence identity to (a) a DNA molecule encoding a PROI 160 polypeptide having 
the sequence of amino acid residues from about I or about 20 to about 90, inclusive of Figure 282 (SEQ ID 
15 NO:394), or (b) the complement of the DNA molecule of (a). 

In another aspect, the invemion concerns an isolated nucleic acid molecule encoding a PROI 160 
polypeptide comprising DNA hybridizing to the complement of the nucleic acid between about nucleotides 40 
or about 97 and about 309. inclusive, of Figure 282 (SEQ ID NO:394). Preferably, hybridization occurs under 
stringent hybridization and wash conditions. 
20 In a further aspect, the invemion concerns an isolated nucleic acid molecule comprising DNA having 

at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least 
about 90% sequence idemity. most preferably at least about 95% sequence identity to (a) a DNA molecule 
encoding the same manire polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203100 
(DNA62872-1509) or (b) the complement of the nucleic acid molecule of (a). In a preferred embodimem. the 
25 nucleic acid comprises a DNA encoding the same mature polypeptide encoded by the human protein cDNA in 
ATCC Deposit No. 203100 (DNA62872-I509). 

In still a further aspect, the invention cotKcms an isolated nucleic acid molecule comprising (a) DNA 
encoding a polypeptide having at least about 80% sequence idemity. preferably at least about 85% sequence 
idemity, more preferably at least about 90% sequence idemity, most pieferably at least about 95% sequence 
30 idemity to the sequence of amino acid residues I or abom 20 to about 90. inclusive of Figure 282 (SEQ ID 
NO:394), or (b) the complement of the DNA of (a). 

In a further aspect, the invemion concerns an isolated nucleic acid molecule having at least 100 
nucleotides and produced by hybridizing a test DNA molecule under stringem conditions with (a) a DNA 
molecule encoding a PROI 160 polypeptide having the sequence of amino acid residues from I or about 20 to 
35 about 90. inclusive of Figure 282 (SEQ ID NO:394). or (b) the complement of the DNA molecule of (a), and, 
if the DNA molecule has at least about an 80 % sequence identity, prefereably at least about an 85% sequence 
idemity. more preferably at least about a 90% sequence identity, most preferably at least about a 95% sequence 
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identity to (a) or (b). isolating the test DNA molecuJe. 

In a specific aspect, the invention provides an isolated nucleic acid molecule comprising DNA encoding 
a PROl 160 polypeptide, with or without the N-ierminal signal sequence and/or the initiating methionine, or is 
compiemeniary to such encoding nucleic acid molecule. The signal peptide has been lenutively identified as 
extending from about amino acid position 1 to about amino acid position 19 in the sequence of Figure 282 (SEQ 
ID NO:394). 

In another aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA 
encoding a polypeptide scoring at least about 80% positives, preferably at least about 85% positives, more 
preferably at least about 90% positives, most preferably at least about 95% positives when compared with the 
amino acid sequence of residues 1 or about 20 to about 90. inclusive of Figure 282 (SEQ ID NO:394). or (b) 
the complement of the DNA of (a). 

Another embodiment is directed to fragments of a PROl 160 polypeptide coding sequence that may find 
use as hybridization probes. Such nucleic acid fragments are from about 20 to about 80 nucleotides in length, 
preferably from about 20 to about 60 nucleotides in length, more preferably from about 20 to about 50 
nucleotides in length and most preferably from about 20 to about 40 nucleotides in length and may be derived 
from the nucleotide sequence shown in Figure 281 (SEQ ID NO:393). 

In another embodiment, the invention provides isolated PROl 160 polypeptide encoded by any of the 
isolated nucleic acid sequences hereinabove identified. 

In a specific aspect, the invention provides isolated native sequence PROl 160 polypeptide, which in 
certain embodiments, includes an amino acid sequence comprising residues 1 or about 20 to about 90 of Figure 
282 (SEQ ID NO:394). 

In another aspect, the invention concerns an isolated PRO 1 160 polypeptide, comprising an amino acid 
sequence having at least about 80% sequence identity, preferably at least about 85% sequence identity, more 
preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the 
sequence of amino acid residues 1 or about 20 to about 90, inclusive of Figure 282 (SEQ ID N0:394). 

In a further aspect, the invention concerns an isolated PROl 160 polypeptide, comprising an amino acid 
sequence scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least 
about 90% positives, most preferably at least about 95% positives when compared with the amino acid sequence 
of residues 1 or about 20 to about 90, inclusive of Figure 282 (SEQ ID NO:394). 

In yei another aspect, the invention concerns an isolated PRO 1 1 60 polypeptide , comprising the sequence 
of amino acid residues 1 or aboui 20 to about 90, inclusive of Figure 282 (SEQ ID NO:394), or a fragment 
thereof sufficient to provide a binding site for an anii-PROl 160 antibody. Preferably, the PR01160 fragmeni 
retains a qualitative biological activity of a native PROl 160 polypeptide. 

In a still further aspect, the invention provides a polypeptide produced by (i) hybridizing a test DNA 
molecule under stringent conditions with (a) a DNA molecule encoding a PRO 1160 polypeptide having the 
sequence of amino acid residues from about 1 or about 20 to about 90. inclusive of Figure 282 (SEQ ID 
NO:394), or (b) the complemem of the DNA molecule of (a), and if die test DNA molecule has at least about 
an «)% sequence identity, preferably at least about an 85% sequence identity, more preferably at least about a 
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90% sequence identity, most preferably at least about a 95% sequence identity to (a) or (b). (ii) culniring a host 
cell comprising the test DNA molecule under conditions suitable for expression of the polypeptide, and (iii) 
recovering the polypeptide from the cell culnire. 

124. PRQ1187 

A cDNA clone (DNA62876-1517) has been identified that encodes a novel polypeptide having sequence 
identity with endo-beia'1.4-xylanase and designated in the present application as "PRO 11 87." 

In one embodiment, the invention provides an isolated nucleic acid molecule comprising DNA encoding 
a PROl 187 polypeptide. 

In one aspect, the isolated nucleic acid comprises DNA having at least about 80% sequence identity, 
preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most 
preferably at least about 95% sequence identity to (a) a DNA molecule encoding a PRO 1 187 polypeptide having 
the sequence of amino acid residues from about 18 to about 120, inclusive of Figure 284 (SEQ ID NO:399), or 
(b) the complement of the DNA molecule of (a). 

In another aspect, the invention concerns an isolated nucleic acid molecule encoding a PROl 187 
polypeptide comprising DNA hybridizing to the complemem of the nucleic acid between about residues 172 and 
about 480, inclusive, of Figure 283 (SEQ ID NO:398). Preferably, hybridization occurs under stringent 
hybridization and wash conditions. 

In a further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA having 
at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least 
about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule 
encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203095 
(DNA62876-1517), or (b) the complement of the DNA molecule of (a). In a preferred embodiment, the nucleic 
acid comprises a DNA encoding the same mature polypeptide encoded by the human protein cDN A in ATCC 
Deposn No. 203095 (DNA62876-1517). 

in a still further aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA 
encoding a polypeptide having at least about 80% sequence identity, preferably at least about 85% sequence 
identity, more preferably at least about 90% sequence idemity, most preferably at least about 95% sequence 
identity to the sequence of amino acid residues from about 18 to about 120, inclusive of Figure 284 (SEQ ID 
NO:399). or the complement of the DNA of (a). 

In a further aspect, the invention concerns an isolated nucleic acid molecule having at least about 50 
nucleotides, and preferably at least about 100 nucleotides nucleotides and produced by hybridizing a test DNA 
molecule under stringent conditions with (a) a DNA molecule encoding a PROl 187 polypeptide having the 
sequence of amino acid residues from about 18 to about 120. inclusive of Figure 284 (SEQ ID NO:399), or (b) 
the complement of the DNA molecule of (a), and. if the DNA molecule has at least about an 80% sequence 
identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence 
idemity, most preferably at least about a 95% sequence identity to (a) or (b), isolating the test DNA molecule. 
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In another aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA 
encoding a po]3^ptide scoring at least about 80% positives, preferably at least about S5% positives, more 
preferably at least about 90% positives, most preferably at least about 95% positives when compared with the 
amino acid sequence of residues 18 to about 120, inclusive of Figure 284 (SEQ ID NO:399), or (b) the 
complement of the DNA of (a). 
5 Another embcKliment is directed to fragments of a PROl 187 polypeptide coding sequence that may find 

use as hybridization probes. Such nucleic acid fragments are from about 20 through about 80 nucleotides in 
length, preferably from about 20 through about 60 nucleotides in length, more preferably from about 20 through 
about 50 nucleotides in length, and most preferably from about 20 through about 40 nucleotides in length. 

in another embodiment, the invention provides isolated PRO 11 87 polypeptide encoded by any of the 
10 isolated nucleic acid sequences hereinabove defined. 

In a specific aspect, the invention provides isolated native sequence PRO 1 1 87 polypeptide, which in one 
embodiment, includes an amino acid sequence comprising residues 18 through 120 of Figure 284 (SEQ ID 
NO:399). 

In another aspect, the invention concerns an isolated PROl 187 polypeptide, comprising an amino acid 
15 sequence having at least about 80% sequence identity, preferably at least about 85% sequence identity, more 
preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the 
sequence of amino acid residues 18 to about 120, inclusive of Figure 284 (SEQ ID NO:399). 

In a further aspect, the invention concerns an isolated PRO 1 1 87 polypeptide, comprising an amino acid 
sequence scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least 
20 about 90% positives, most preferably at least about 95 % positives when compared with the amino acid sequence 
of residues 18 through 120 of Figure 284 (SEQ ID NO: 399). 

In yet another aspect, the invention concerns an isolated PRO 1 1 87 polypeptide, comprising the sequence 
of amino acid residues 18 to about 120, inclusive of Figure 284 (SEQ ID NO:399), or a fragment thereof 
sufficient to provide a binding site for an anti-PROl 187 antibody. Preferably, the PROl 187 fragment retains 
25 a qualitative biological activity of a native PROl 187 polypeptide. 

In a still further aspect, the invention provides a polypeptide produced by (i) hybridizing a test DNA 
molecule under stringent conditions with (a) a DNA molecule encoding a PROl 187 polypeptide having the 
sequence of amino acid residues from about 18 to about 120, inclusive of Figure 284 (SEQ ID NO:399), or (b) 
the complement of the DNA molecule of (a), and if the test DNA molecule has at least about an 80% sequence 
30 identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence 
identity, most preferably at least about a 95% sequence identity to (a) or (b), (ii) culturing a host cell comprising 
the test DNA molecule under conditions suitable for expression of the polypeptide, and (iii) recovering the 
polypeptide from the cell culture. 

In yet another embodiment, the invention concerns agonists and antagonists of the a native PROl 187 
35 polypeptide. In a particular embodiment, the agonist or antagonist is an anii-PR01187 antibody. 

In a further embodiment, the invention concerns a method of identifying agonists or antagonists of a 
native PROl 187 polypeptide, by contacting the native PROl 187 polypeptide with a candidate molecule and 
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moniioring a biological activity mediated by said polypeptide. 

Inastill further embodiment, the invention concerns a composition comprising a PROl 187 polypeptide, 
or an agonist or antagonist as hereinabove defmed, in combination with a pharmaceutically acceptable carrier. 

125. PROl 185 

5 A cDN A clone (DN A6288 1-1 5 1 5) has been identified that encodes a novel polypeptide having sequence 

idemity to a glucose repression regulatory protein, mpl. and designated in the present application as 
"PROl 185." 

In one embodiment, the invention provides an isolated nucleic acid molecule comprising DNA encoding 
a PRO 1 185 polypeptide. 

10 In one aspect, the isolated nucleic acid comprises DNA having at least about 80% sequence identity, 

preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most 
preferably at least about 95% sequence identity to (a) a DNA molecule encoding a PRO 1 185 polypeptide having 
the sequence of amino acid residues from about 22 to about 198. inclusive of Figure 286 (SEQ ID NO:401). or 
(b) the complement of the DNA molecule of (a). 

15 in another aspect, die invention concerns an isolated nucleic acid molecule encoding a PROl 185 

polypeptide comprising DNA hybridizing to the complement of the nucleic acid between about residues 67 and 
about 597. inclusive, of Figure 285 (SEQ ID NO:400). Preferably, hybridization occurs under stringent 
hybridization and wash conditions. 

In a further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA having 

20 at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least 
about 90% sequence identity, mos! preferably at least about 95% sequence idemity to (a) a DNA molecule 
encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203096 
(DNA6288i-1515). or (b) the complement of the DNA molecule of (a). In a preferred embodiment, the nucleic 
acid comprises a DNA encoding die same mature polypeptide encoded by the human protein cDNA in ATCC 

25 Deposit No. 203096 (DNA62881 -1515), 

In a still fiuiher aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA 
encoding a polypeptide having at least about 80% sequence identity, preferably at least about 85% sequence 
identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence 
identity to the sequence of amino acid residues from about 22 to about 198. inclusive of Figure 286 (SEQ ID 

30 NO:401), or the complement of the DNA of (a). 

In a further aspect, the invention concerns an isolated nucleic acid molecule having at least about 50 
nucleotides, and preferably at least about 100 nucleotides nucleotides and produced by hybridizing a test DNA 
molecule under stringent conditions with (a) a DNA molecule encoding a PR01I85 polypeptide having die 
sequence of amino acid residues from about 22 to about 198. inclusive of Figure 286 (SEQ ID N0:401), or (b) 

35 the complement of the DNA molecule of (a), and, if the DNA molecule has at least about an 80% sequence 
identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence 
identity, most preferably at least about a 95% sequence identity to (a) or (b). isolating the test DNA molecule. 
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In another aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA 
encoding a polypeptide scoring at least about 80% positives, preferably at least about 85% positives, more 
preferably at least about 90% positives, most preferably at least about 95% positives when compared with the 
amino acid sequence of residues 22 to about 198. inclusive of Figure 286 (SEQ ID NO:401), or (b) the 
complement of the DNA of (a). 
5 Another embodiment is directed to fragments of a PRO 1 1 85 polypeptide coding sequence that may fmd 

use as hybridization probes. Such nucleic acid fragments are from about 20 through about 80 nucleotides in 
length, preferably from about 20 through about 60 nucleotides in length, more preferably from about 20 through 
about 50 nucleotides in length, and most preferably from about 20 through about 40 nucleotides in length. 

In another embodiment, the invention provides isolated PRO 11 85 polypeptide encoded by any of the 
10 isolated nucleic acid sequences hereinabove defmed. 

In a specific aspect, die invention provides isolated native sequence PRO 1 1 85 polypeptide, which in one 
embodiment, includes an amino acid sequence comprising residues 22 through 198 of Figure 286 (SEQ ID 
NO:401). 

In another aspect, the invention concerns an isolated PROl 185 polypeptide, comprising an amino acid 
IS sequence having at least about 80% sequence identity, preferably at least about 85% sequence identity, more 
preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the 
sequence of amino acid residues 22 to about 198, inclusive of Figure 286 (SEQ ID NO:401). 

In a further aspect, the invention concerns an isolated PROl 1 85 polypeptide, comprising an amino acid 
sequence scoring at least about 80% positives, preferably at least about 85% positives, more preferably at leasi 
20 about 90 % positives, most preferably at least about 95 % positives when compared with the amino acid sequence 
of residues 22 through 198 of Figure 286 (SEQ ID NO:40l). 

In yet another aspect, the invention concerns an isolated PRO 1 1 85 polypeptide , comprising the sequence 
of amino acid residues 22 to about 198, inclusive of Figure 286 (SEQ ID NO:401), or a fragment thereof 
sufficient to provide a binding site for an anti-PROl 185 antibody. Preferably, the PROl 185 fragment retains 
25 a qualitative biological activity of a native PROl 185 polypeptide. 

In a still further aspect, the invention provides a polypeptide produced by (i) hybridizing a test DNA 
molecule imder stringent conditions with (a) a DNA molecule encoding a PRO 11 85 polypeptide having the 
sequence of amino acid residues from about 22 to about 198. inclusive of Figure 286 (SEQ ID N0:40l), or (b) 
the complement of the DNA molecule of (a), and if the test DNA molecule has at least about an 80% sequence 
30 identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence 
identity, most preferably at least about a 95% sequence identity to (a) or (b), (ii) culturing a host cell comprising 
the lest DNA molecule under conditions suitable for expression of the polypeptide, and (iii) recovering the 
polypeptide from the cell culture. 

In yei another embodiment, the invention concerns agonists and antagonists of the a native PROl 185 
35 polypeptide. In a panicular embodiment, the agonist or antagonist is an anti-PROl 185 antibody. 

In a further embodiment, the invention concerns a method of identifying agonists or antagonists of a 
native PROl 185 polypeptide, by contacting the native PROl 185 polypeptide with a candidate molecule and 
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monitoring a biological activity mediated by said polypeptide. 

In a still further embodiment, the invention concerns a composition comprising a PROl 185 polypeptide, 
or an agonist or antagonist as hereinabove defmed, in combination with a pharmaceutically acceptable carrier. 

126. PR0134S 

5 A cDNA clone (DNA64852-1589) has been identified, having homology to nucleic acid encodbg 

tetranecun protein that encodes a novel polypeptide, designated in the present application as "PR01345". 

Id one embodiment, the invention provides an isolated nucleic acid molecule comprising DN A encoding 
a PRO 1345 polypeptide. 

In one aspect, the isolated nucleic acid comprises DNA having at least about 80% sequence identity, 

10 preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most 
preferably at least about 95% sequence identity to (a) a DNA molecule encoding a PRO 1345 polypeptide having 
the sequence of amino acid residues from about 1 or about 32 to about 206, inclusive of Figure 288 (SEQ ID 
NO:403), or (b) the complement of the DNA molecule of (a). 

In another aspect, the invention concerns an isolated nucleic acid molecule encoding a PRO 1 345 

15 polypeptide comprising DNA hybridizing to the complement of the nucleic acid between about nucleotides 7 or 
about 100 and about 624, inclusive, of Figure 287 (SEQ ID NO:402). Preferably, hybridization occurs under 
stringent hybridization and wash conditions. 

In a further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA having 
at least about 80% sequence identity, preferably at least aboui 85% sequence identity, more preferably at least 

20 about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule 
encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203127 
(DNA64852-1589) or (b) the complement of the nucleic acid molecule of (a). In a preferred embodiment, the 
nucleic acid comprises a DNA encoding the same mature polypeptide encoded by the human protein cDNA in 
ATCC Deposit No. 203127 (DNA64852-1589). 

25 In still a further aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA 

encoding a polypeptide having at least about 80% sequence identity, preferably at least about 85% sequence 
identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence 
identity to the sequence of amino acid residues 1 or about 32 to about 206, inclusive of Figure 288 (SEQ ID 
NO:403), or (b) the complement of the DNA of (a). 

30 In a further aspect, the invention concerns an isolated nucleic acid molecule having at least 100 

nucleotides and produced by hybridizing a test DNA molecule under stringent conditions with (a) a DNA 
molecule encoding a PR01345 polypeptide having the sequence of amino acid residues from 1 or about 32 to 
about 206, inclusive of Figure 288 (SEQ ID NO:403), or (b) the complement of the DNA molecule of (a), and, 
if the DNA molecule has at least about an 80 % sequence identity, prefereably at least about an 85% sequence 

35 identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95 % sequence 
identity to (a) or (b), isolating the test DNA molecule. 
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In a sp^ecific aspect, the invention provides an isolated nucleic acid molecule comprising DN A encoding 
a PR01345 polypeptide, with or without the N*tenntnal signal sequence and/or the initiating methionine, or is 
complementary to such encoding nucleic acid molecule. The signal peptide has been tentatively identified as 
extending from about amino acid position 1 or amino acid 10 to about amino acid position 31 in the sequence 
of Figure 288 (SEQ ID NO:403). 
5 In another aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA 

encoding a polypeptide scoring at least about 80% positives, preferably at least about 85% positives, more 
preferably at least about 90% positives, most preferably at least about 95% positives when compared with the 
amino acid sequence of residues 1 or about 32 to about 206. inclusive of Figure 288 (SEQ ID NO:403), or (b) 
the complement of the DNA of (a). 

10 Another embodiment is directed to fragments of a PRO 1345 polypeptide coding sequence that may find 

use as hybridization probes. Such nucleic acid fragments are from about 20 to about 80 nucleotides in length « 
preferably from about 20 to about 60 nucleotides in length, more preferably from about 20 to about 50 
nucleotides in length and most preferably from about 20 to about 40 nucleotides in length and may be derived 
from the nucleotide sequence shown in Figure 287 (SEQ ID NO:402). 

15 In another embodiment, the invention provides isolated PRO 1345 polypeptide encoded by any of the 

isolated nucleic acid sequences hereinabove identified. 

In a specific aspect, the invention provides isolated native sequence PRO 1345 polypeptide, which in 
cenain embodiments, includes an amino acid sequence comprising residues 1 or about 32 to about 206 of Figure 
288 (SEQ ID NO:403). 

20 In another aspect, the invention concerns an isolated PRO 1 345 polypeptide, comprising an amino acid 

sequence having at least about 80% sequence identity, preferably ai least about 85% sequence identity, more 
preferably ai least about 90% sequence identity, most preferably at least about 95% sequence identity to the 
sequence of amino acid residues 1 or about 32 to about 206, inclusive of Figure 288 (SEQ ID N0:403). 

In a further aspect, the invention concerns an isolated PRO 1 345 polypeptide, comprising an amino acid 

25 sequence scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least 
about 90% positives, most preferably at least about 95% positives when compared with the amino acid sequence 
of residues 1 or about 32 to about 206, inclusive of Figure 288 (SEQ ID NO:403). 

In yet another aspect, the invention concerns an isolated PRO 1 345 polypeptide , comprising the sequence 
of amino acid residues 1 or about 32 to about 206, inclusive of Figure 288 (SEQ ID NO:403), or a fragment 

30 thereof sufficient to provide a binding site for an anti-PR01345 antibody. Preferably, the PR01345 fragment 
retains a qualitative biological activity of a native PRO 1345 polypeptide. 

In a still further aspect, the invention provides a polypeptide produced by (i) hybridizing a test DNA 
molecule under stringent conditions with (a) a DNA molecule encoding a PRO 1345 polypeptide having the 
sequence of amino acid residues from about 1 or about 32 to about 206, inclusive of Figure 288 (SEQ ID 

35 NO:403), or (b) the complement of the DNA molecule of (a), and if the test DNA molecule has at least about 
an 80% sequence identity, preferably at least about an 85% sequence identity, more preferably at least about a 
90% sequence identity, most preferably at least about a 95% sequence identity to (a) or (b), (ii) culiuring a host 
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cell comprising the test DNA molecule under conditions suitable for expression of the polypeptide, and (iii) 
recovering the polypeptide from the cell culture. 

In yci another embodiment, the invention concerns agonists and antagonists of a native PR01345 
polypeptide. In a panicular embodiment, the agonist or antagonist is an anti-PR01345 antibody. 

In a further embodiment, the invention concerns a method of identifying agonists or antagonists of a 
5 native PR01345 polypeptide by contacting the native PR01345 polypeptide with a candidate molecule and 
monitoring a biological activity mediated by said polypeptide. 

In a stiJI further embodiment, the invention concerns a composition comprising a PR01345 polypeptide, 
or an agonist or antagonist as hereinabove defined, in combination with a pharmaceutically acceptable carrier. 

10 127. PR01245 

A cDNA clone (DNA64884-1527) has been identified that encodes a novel secreted p>olypepiide 
designated in the present application as "PR01245.- 

In one embodiment, the invention provides an isolated nucleic acid molecule comprising DNA encoding 
a PR01245 polypeptide. 

15 In one aspect, the isolated nucleic acid comprises DNA having at least about 80% sequence identity, 

preferably at least about 85% sequence identity, more preferably ai least about 90% sequence identity, most 
preferably at least about 95% sequence identity to (a) a DNA molecule encoding a PRO 1245 polypeptide having 
the sequence of amino acid residues from about 19 to about 104, inclusive of Figure 290 (SEQ ID NO:408), or 
(b) the complement of die DNA molecule of (a). 

20 In another aspect, the invention concerns an isolated nucleic acid molecule encoding a PR01245 

polypeptide comprising DNA hybridizing to the complement of the nucleic acid between about residues 133 and 
about 390, inclusive, of Figure 289 (SEQ ID NO:407). Preferably, hybridization occurs under stringem 
hybridization and wash conditions. 

In a further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA having 

25 at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least 
about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule 
encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203155 
(DNA64884-1245), or (b) the complement of the DNA molecule of (a). In a preferred embodiment, the nucleic 
acid comprises a DNA encoding the same mature polypeptide encoded by the human protein cDN A in ATCC 

30 Deposit No. 203 155 (DNA64884-1245). 

In a still further aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA 
encoding a polypeptide having at least about 80% sequence identity, preferably at least about 85% sequence 
identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence 
identity to the sequence of amino acid residues from about 19 to about 104, inclusive of Figure 290 (SEQ ID 

35 NO:408), or the complement of the DNA of (a). 

In a further aspect, the invention concerns an isolated nucleic acid molecule having at least about 50 
nucleotides, and preferably at least about 100 nucleotides and produced by hybridizing a test DNA molecule 
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under stringent conditions with (a) a DNA molecule encoding a PR0124S polypeptide having the sequence of 
amino acid residues from about 19 to about 104, inclusive of Figure 290 (S£Q ID NO:408)» or (b) the 
complement of the DNA molecule of (a), and, if the DNA molecule has at least about an 80% sequence identity, 
preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most 
preferably at least about a 95% sequence identity to (a) or (b). isolating the test DNA molecule. 
5 In a specific aspect, the invention provides an isolated nucleic acid molecule comprising DNA encoding 

a PR01245 polypeptide, with or without the N-ierminal signal sequence and/or the initiating methionine, or is 
complementary to such encoding nucleic acid molecule. The signal peptide has been tentatively identified as 
extending from amino acid position 1 through about amino acid position 18 in the sequence of Figure 290 (SEQ 
ID NO:408). 

10 In another aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA 

encoding a polypeptide scoring at least about 80% positives, preferably at least about 85% positives, more 
preferably at least about 90% positives, most preferably at least about 95% positives when compared with the 
amino acid sequence of residues 19 to about 104, inclusive of Figure 290 (SEQ ID NO:408), or (b) the 
complement of the DNA of (a). 

15 Another embodiment is directed to fragments of a PR01245 polypeptide coding sequence that may find 

use as hybridization probes. Such nucleic acid fragments are from about 20 to about 80 nucleotides in length, 
preferably from about 20 to about 60 nucleotides in length, more preferably from about 20 to about 50 
nucleotides in length, and most preferably from about 20 to about 40 nucleotides in length. 

In another embodiment, the invention provides isolated PRO 1 245 polypeptide encoded by any of the 

20 isolated nucleic acid sequences hereinabove defmed. 

In a specific aspect, the invention provides isolated native sequence PRO 1 245 polypeptide, which in one 
embodiment, includes an amino acid sequence comprising residues 19 to 104 of Figure 290 (SEQ ID NO:408). 

In another aspect, the invention concerns an isolated PRO 1 245 polypeptide, comprising an amino acid 
sequence having at least about 80% sequence identity, preferably at least about 85% sequence identity, more 

25 preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the 
sequence of amino acid residues 19 to about 104, inclusive of Figure 290 (SEQ ID NO:408). 

In a further aspect, the invention concerns an isolated PR01245 polypeptide, comprising an amino acid 
sequence scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least 
about 90% positives, most preferably at least about 95 % positives when compared with the amino acid sequence 

30 of residues 19 to 104 of Figure 290 (SEQ ID NO:408). 

In yet another aspect, the invention concerns an isolated PRO 1245 polypeptide, comprising the sequence 
of amino acid residues 19 to about 104, inclusive of Figure 290 (SEQ ID NO:408), or a fragment thereof 
sufficient to provide a binding site for an anti-PR01245 antibody. Preferably, the PR01245 fragment retains 
a qualitative biological activity of a native PRO 1 245 polypeptide. 

35 In a still further aspect, the invention provides a polypeptide produced by (i) hybridizing a test DNA 

molecule under stringent conditions with (a) a DNA molecule encoding a PRO 1245 polypeptide having the 
sequence of amino acid residues from about 19 to about 104, inclusive of Figure 290 (SEQ ID NO:408), or (b) 
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the complement of the DNA molecule of (a), and if the test DNA molecule has at least about an 80% sequence 
identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence 
identity, most preferably at least about a 95 % sequence idemity to (a) or (b). (ii) culniring a host cell comprising 
the test DNA molecule under conditions suitable for expression of the polypeptide, and (iii) recovering the 
polypeptide from the cell culnire. 

5 

128. PR01358 

AcDNAclone(DNA64890-1612)hasbeenidemifiedtha,encodesanovelpolypeptidehavi„gsequence 
identity with RASP-1 and designated in the present application as "PR01358 " 

In one embodimem. the invcmion provides an isolated nucleic acid molecule comprising DNA encoding 
10 a PR01358 polypeptide. 

In one aspect, the isolated nucleic acid comprises DNA having ai least about 80% sequence idemity. 
preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most 
preferably at least about 95% sequence identity to (a) a DNA molecule encoding a PR01358 polypeptide having 
the sequence of amino acid residues from about 19 to about 444, inclusive of Figure 292 (SEQ ID NO:4 10). or 
15 (b) the complement of the DNA molecule of (a). 

In another aspect, the invention concerns an isolated nucleic acid molecule encoding a PR01358 
polypeptide comprising DNA hybridizing to the complement of the nucleic acid between about residues 140 and 
about 1417. inclusive, of Figure 292 (SEQ ID NO:4IO). Preferably, hybridization occurs under stringem 
hybridization and wash conditions. 

20 In a further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA having 

at least abou. 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least 
about 90% sequence identity, most preferably at least abou. 95% sequence identity to (a) a DNA molecule 
encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203131 
(DNA64890- 1612), or (b) the complement of the DNA molecule of (a). In a preferred embodimem, the nucleic 

25 acid comprises a DNA encoding the same mature polypeptide encoded by the human protein cDNA in ATCC 
Deposit No. 203131 (DNA64890-1612). 

In a still further aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA 
encoding a polypeptide having at least about 80% sequence idemity. preferably at least about 85% sequence 
identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence 

30 identity to the sequence of amino acid residues from about 19 to about 444. inclusive of Figure 292 (SEQ ID 
NO:410). or the complement of the DNA of (a). 

In a further aspect, the invemion concerns an isolated nucleic acid molecule having at least about 50 
nucleotides, and preferably at least about 100 nucleotides and produced by hybridizing a test DNA molecule 
under stringent conditions with (a) a DNA molecule encoding a PR01358 polypeptide having the sequence of 

35 amino acid residues from about 19 to about 444. inclusive of Figure 292 (SEQ ID NO:410). or (b) the 
complement of the DNA molecule of (a), and. if the DNA molecule has a. least about an 80% sequence identity, 
preferably at least about an 85 % sequence identity, more preferably at least about a 90% sequence idemity. most 
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preferably at least about a 95% sequence identity to (a) or (b), isolating the test DNA molecule. 

In another aspect, the invenuon concerns an isolated nucleic acid molecule comprising (a) DNA 
encoding a polypeptide scoring at least about 80% positives, preferably at least about 85% positives, more 
preferably at least about 90% posiiives, most preferably at least about 95% positives when compared with the 
amino acid sequence of residues 19 to about 444, inclusive of Figure 292 (SEQ ID NO:410). or (b) the 
5 complement of the DNA of (a). 

Another embodiment is directed to fragments of a PR01358 polypeptide coding sequence that may find 
use as hybridization probes. Such nucleic acid fragments are from about 80 nucleotides to about 120 nucleotides 
in length. 

In another embodiment, the invention provides isolated PR01358 polypeptide encoded by any of the 
10 isolated nucleic acid sequences hereinabove defined. 

In a specific aspect, the invention provides isolated native sequence PR01358 polypeptide, which in one 
embodiment, includes an amino acid sequence comprising residues 19 dirough 444 of Figure 292 (SEQ ID 
NO:410). 

In another aspect, the invention concerns an isolated PR01358 polypeptide, comprising an amino acid 
15 sequence having at least about 80% sequence idemiiy, preferably at least about 85% sequence idemity, more 
preferably at least about 90% sequence idemity, most preferably at least about 95% sequence identity to the 
sequence of amino acid residues 19 to about 444, inclusive of Figure 292 (SEQ ID NO:410). 

In a further aspect, the invention concerns an isolated PRO 1 358 polypeptide, comprising an amino acid 
sequence scoring at least about 80% posiiives, preferably at leasi about 85% positives, more preferably at least 
20 about 90% positives, most preferably at least about 95 % positives when compared with the amino acid sequence 
of residues 19 through 444 of Figure 292 (SEQ ID NO:410). 

in yet another aspect, the invemion concerns an isolated PRO 1 358 polypeptide . comprising the sequence 
of amino acid residues 19 to about 444. mclusive of Figure 292 (SEQ ID NO:410), or a fragment thereof 
sufficiem to provide a binding site for an anti-PR01358 antibody specific therefore. Preferably, the PR01358 
25 fragment retains a qualitative biological activity of a native PRO 1358 polypeptide. 

In a still further aspect, the invemion provides a polypeptide produced by (i) hybridizing a test DNA 
molecule under stringent conditions with (a) a DNA molecule encoding a PRO 1358 polypeptide having the 
sequence of amino acid residues from about 19 to about 444, inclusive of Figure 292 (SEQ ID N0:410), or (b) 
the complement of the DNA molecule of (a), and if the test DNA molecule has at least abom an 80% sequence 
30 identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence 
idemity, most preferably at least about a 95 % sequence idemiiy to (a) or (b), (ii) culniring a host cell comprising 
the test DNA molecule under conditions suiuble for expression of the polypeptide, and (iii) recovering the 
polypeptide from the cell culture. 

In yet another embodimem, the invemion concerns agonists and antagonists of a naiive PR01358 
35 polypeptide. In a panicular embodiment, the agonist or antagonist is an anti-PR01358 antibody. 

In a ftirther embodiment, the invention concerns a method of identifying agonists or antagonists of a 
naiive PR01358 polypeptide, by contacting the native PR01358 polypeptide with a candidate molecule and 
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monitoring a biological activity mediated by said po]ypq)tide. 

In a siill further embodiment, the invention concerns a composition comprising a PR01358 polypeptide, 
or an agonist or antagonist as hereinabove defmed, in combination with a pharmaceutically acceptable carrier. 

129. PR01195 

5 A cDN A clone (DNA654 1 2- 1 523) has been identified that encodes a novel polypeptide having sequence 

identity with a mouse proline rich acidic protein and designated in the present application as "PRO 1 195." 

In one embcxlimem. the invention provides an isolated nucleic acid molecule comprising DN A encoding 
a PR01195 polypeptide. 

In one aspect, the isolated nucleic acid comprises DNA having at least about 80% sequence identity, 

10 preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most 
preferably at least about 95% sequence identity to (a) a DNA molecule encoding a PROl 195 polypeptide having 
the sequence of amino acid residues from about 23 to about 151, inclusive of Figure 294 (SEQ ID NO:412), or 
(b) the complement of the DNA molecule of (a). 

In another aspect, the invention concerns an isolated nucleic acid molecule encoding a PROl 195 

15 polypeptide comprising DNA hybridizing to the complement of the nucleic acid between about residues 124 and 
about 510, inclusive, of Figure 293 {SEQ ID NO:411). Preferably, hybridization occurs under stringent 
hybridization and wash conditions. 

In a further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA having 
at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least 

20 about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule 
encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203094 
(DNA65412-1523), or (b) the complement of the DNA molecule of (a). In a preferred embodiment, the nucleic 
acid comprises a DNA encoding the same mature polypeptide encoded by the human protein cDNA in ATCC 
Deposit No, 203094 (DNA65412-1523). 

25 In a still further aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA 

encoding a polypeptide having at least about 80% sequence identity, preferably at least about 85% sequence 
identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence 
identity to the sequence of amino acid residues from about 23 to about 151, inclusive of Figure 294 (SEQ ID 
NO:412), or the complement of the DNA of (a). 

30 In a further aspect, the invention concerns an isolated nucleic acid molecule having at least about 50 

nucleotides, and preferably at least about 100 nucleotides and produced by hybridizing a test DNA molecule 
under stringent conditions with (a) a DNA molecule encoding a PROl 195 polypeptide having die sequence of 
amino acid residues from about 23 to about 151, inclusive of Figure 294 (SEQ ID NO:412), or (b) the 
complement of the DNA molecule of (a), and, if the DNA molecule has at least about an 80% sequence identity. 

35 preferably at least about an 85% sequence identity, more preferably at leasi about a 90% sequence identity, most 
preferably at least about a 95% sequence identity to (a) or (b), isolating die test ON A molecule. 
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In another aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA 
encoding a polypeptide scoring at least atom 80% positives, preferably at least about 85% positives, more 
preferably at least about 90% positives, most preferably at least about 95% positives when compared with the 
amino acid sequence of residues 23 to about 151, inclusive of Figure 294 (SEQ ID NO:412). or (b) the 
complement of the DNA of (a). 
5 Another embodiment is directed to fragments of a PROl 195 polypeptide coding sequence thai may find 

use as hybridization probes. Such nucleic acid fragments are from about 20 through about 80 nucleotides in 
length, preferably from about 20 through about 60 nucleotides in length, more preferably from about 20 through 
about 50 nucleotides in length, and most preferably from about 20 through about 40 nucleotides in length. 

In another embodiment, die invention provides isolated PROl 195 polypeptide encoded by any of the 
10 isolated nucleic acid sequences hereinabove defined. 

In a specific aspect, the invention provides isolated native sequence PRO 1 1 95 polypeptide, which in one 
embodiment, includes an amino acid sequence comprising residues 23 through 151 of Figure 294 (SEQ ID 
NO:412). 

In another aspect, the invention concerns an isolated PROl 195 polypeptide, comprising an amino acid 
IS sequence having at least about 80% sequence identity, preferably at least about 85% sequence identity, more 

preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the 

sequence of amino acid residues 23 to about 151, inclusive of Figure 294 (SEQ ID NO:412). 

In a further aspect, the invention concerns an isolated PRO 1 1 95 polypeptide . comprising an amino acid 

sequence scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least 
20 about 90% positives, most preferably at least about 95 % positives when compared with the amino acid sequence 

of residues 23 through 151 of Figure 294 (SEQ ID NO:412). 

In a still further aspect, the invention provides a polypeptide produced by (i) hybridizing a test DNA 

molecule imder stringent conditions with (a) a DNA molecule encoding a PROl 195 polypeptide having the 

sequence of amino acid residues from about 23 to about 151, inclusive of Figure 294 (SEQ ID NO:412). or (b) 
25 the complement of the DNA molecule of (a), and if the test DNA molecule has at least about an 80% sequence 

identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence 

identity, most preferably at least about a 95% sequence identity to (a) or (b). (ii) culturing a host cell comprising 

the test DNA molecule imder conditions suitable for expression of the polypeptide, and (iii) recovering the 

polypeptide from the cell culnire. 
30 In yet another embodiment, the invention concerns agonists and antagonists of the a native PROl 195 

polypeptide. In a particular embodiment, the agonist or antagonist is an anti-PROl 195 antibody. 

In a further embodiment, the invention concerns a method of identifying agonists or antagonists of a 

native PROl 195 polypeptide, by contacting the native PROl 195 polypeptide with a candidate molecule and 

monitoring a biological activity mediated by said polypeptide. 
35 In a still further embodiment, the invention concerns a composition comprising a PROl 195 polypeptide, 

or an agonist or antagonist as hereinabove defmed. in combination with a phannaceutically acceptable carrier. 
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130. PRO1270 

A cDNA clone (DNA66308-1537) has been identified, having homoiogy to nucleic acid encoding a 
lectin protein, that encodes a novel polypeptide, designated in the present application as "PROI270''. 

In one embodiment, the invention provides an isolated nucleic acid molecule comprising DNA encoding 
a PRO1270 polypeptide. 

5 In one aspect^ the isolated nticleic acid comprises DNA having at least about 80% sequence identity, 

preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most 
preferably at least about 95% sequence identity to (a) a DNA molecule encoding a PRO 1270 polypeptide having 
the sequence of amino acid residues from about 1 or about 17 to about 313, inclusive of Figure 296 (SEQ ID 
NO:414), or (b) the complement of die DNA molecule of (a). 

10 In another aspect, the invention concerns an isolated nucleic acid molecule encoding a PRO1270 

polypeptide comprising DNA hybridizing to the complement of the nucleic acid between about nucleotides 103 
or about 151 and about 1041, inclusive, of Figure 295 (SEQ ID NO:413). Preferably, hybridization occurs 
under stringent hybridization and wash conditions. 

In a further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA having 

15 at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least 
about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule 
encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203159 
(DNA66308-1537) or (b) the complement of the nucleic acid molecule of (a). In a preferred embodiment, die 
nucleic acid comprises a DNA encoding the same mature polypeptide encoded by the human protein cDNA in 

20 ATCC Deposit No. 203 159 (DNA66308- 1537). 

In still a further aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA 
encoding a polypeptide having at least about 80% sequence identity, preferably at least about 85% sequence 
identity^ more preferably at least about 90% sequence identity, most preferably at least about 95% sequence 
identity to the sequence of amino acid residues 1 or about 17 to about 313» inclusive of Figure 296 (SEQ ID 

25 NO:4 14), or (b) the complement of the DNA of (a). 

In a further aspect, the invention concerns an isolated nucleic acid molecule having at least 285 
nucleotides and produced by hybridizing a test DNA molecule under stringent conditions widi (a) a DNA 
molecule encoding a PRO1270 polypeptide having the sequence of amino acid residues from 1 or about 17 to 
about 313. inclusive of Figure 296 (SEQ ID NO:414), or (b) die complement of the DNA molecule of (a), and, 

30 if the DNA molecule has at least about an 80 % sequence identity, prefereably at least about an 85% sequence 
identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95% sequence 
identity to (a) or (b). isolating the test DNA molecule. 

In a specific aspect, the invention provides an isolated nucleic acid molecule comprising DNA encoding 
a PRO1270 polypeptide, with or without the N-terminal signal sequence and/or the initiating mediionine. or is 

35 complementary to such encoding nucleic acid molecule. The signal peptide has been tentatively identified as 
extending from about amino acid position 1 to about amino acid position 16 in the sequence of Figure 296 (SEQ 
ID NO:4l4). 
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In another aspect, the invention concerns an isolated nucleic acid molectUe comprismg (a) DNA 
encoding a polypeptide scoring at least about 80% positives, preferably at least about 85% positives, more 
preferably at least about 90% positives, most preferably at least about 95% posHives when compared with the 
amino acid sequence of residues 1 or about 17 to about 313. inclusive of Figure 296 (SEQ ID NO:414). or (b) 
the complement of the DNA of (a). 

Another embodiment is directed to fragments of a PRO1270 polypeptide coding sequence that may find 
use as hybridization probes. Such nucleic acid fragments are from about 20 to about 80 nucleotides in lengd>. 
preferably from about 20 to about 60 nucleotides in length, more preferably from about 20 to about 50 
nucleotides in length and most preferably from about 20 to about 40 nucleotides in length and may be derived 
from the nucleotide sequence shown in Figure 295 (SEQ ID NO:413). 

In another embodiment, the invention provides isolated PRO1270 polypeptide encoded by any of the 
isolated nucleic acid sequences hereinabove identified. 

In a specific aspect, the mvention provides isolated native sequence PRO 1270 polypeptide, which in 
certain embodiments, includes an amino acid sequence comprising residues 1 or about 17 to about 313 of Figure 
296 (SEQ ID NO:414). 

In another aspect, die invemion concerns an isohited PRO1270 polypeptide, comprising an amino acid 
sequence having at least about 80% sequence identity, preferably at least about 85% sequence identity, more 
preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to die 
sequence of amino acid residues 1 or about 17 to about 313. inclusive of Figure 296 (SEQ ID NO:414). 

In a further aspect, the invemion concerns an isolated PRO 1270 polypeptide, comprising an amino acid 
sequence scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least 
about 90% positives, most preferably at least about 95% poshives when compared witfi die amino acid sequence 
of residues 1 or about 17 to about 313. inclusive of Figure 296 (SEQ ID N0:4I4). 

In yet anodier aspect, die invention concerns an isolated PRO1270 polypeptide, comprising die sequence 
of amino acid residues 1 or ab9ut 17 to about 313. inclusive of Figure 296 (SEQ ID NO:414). or a fragmem 
25 diereof sufficiem to provide a binding site for an anti-PRO1270 antibody. Preferably, die PROI270 fragment 
retains a qualiuuve biological activity of a native PRO1270 polypeptide. 

In a still fiinher aspect, die invention provides a polypeptide produced by (i) hybridizing a test DNA 
molecule under stringem conditions widi (a) a DNA molecule encoding a PRO1270 polypeptide having die 
sequence of amino acid residues from about 1 or about 17 to about 313. inclusive of Figure 296 (SEQ ID 
30 NO:4I4). or (b) die complemem of die DNA molecule of (a), and if die test DNA molecule has at least about 
an 80% sequence idemity. preferably at least about an 85% sequence identity, more preferably at least about a 
90% sequence identity, most preferably at least abow a 95% sequence idemity to (a) or (b). (ii) culturing a host 
cell comprising die test DNA molecule under conditions suiuble for expression of die polypeptide, and (iii) 
recovering die polypeptide from die cell culmre. 
35 In yet anodier embodimem, die invention concerns agonists and antagonists of a native PRO1270 

pirfypeptide. In a particular embodiment, die agonist or antagonist is an anti-PRO1270 antibody. 
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In a further embodiment, the invention concerns a method of identifying agonists or antagonists of a 
native PRO1270 polypeptide by contacting the native PRO1270 polypeptide with a candidate molecule and 
monitoring a biological activity mediated by said polypeptide. 

in a still further embodiment, the invention concerns a composition comprising a PRO 1270 polypeptide, 
or an agonist or antagonist as hereinabove defmed, in combination with a pharmaceutically acceptable carrier. 

5 

131. PR01271 

A cDNA clone (DNA66309-1538) has been identified thai encodes a novel polypeptide having serine 
and threonine rich regions designated in the present application as "PR01271" polypeptides. 

In one embodiment, the invention provides an isolated nucleic acid molecule comprising DNA encoding 

10 a PR01271 polypeptide. 

In one aspect, the isolated nucleic acid comprises DNA having at least about 80% sequence identity, 
preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most 
preferably at least about 95% sequence identity to (a) a DNA molecule encoding a PRO 1271 polypeptide having 
the sequence of amino acid residues from about 32 to about 208, inclusive of Figure 298 (SEQ ID NO:416), or 

15 (b) the complement of die DNA molecule of (a). 

In anoUier aspect, the invention concerns an isolated nucleic acid molecule encoding a PRO 127 1 
polypeptide comprismg DNA hybridizing to the complement of the nucleic acid between about residues 187 and 
about 717, inclusive, of Figure 297 (SEQ ID NO:415). Preferably, hybridization occurs under stringem 
hybridization and wash conditions. 

20 In a funher aspect, the invention concerns an isolated nucleic acid molecule comprising DNA having 

at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least 
about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule 
encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203235 
(DNA66309-1538). or (b) the complement of the DNA molecule of (a). In a preferred embodiment, the nucleic 

25 acid comprises a DNA encoding the same mature polypeptide encoded by the human protein cDNA in ATCC 
Deposit No. 203235 (DNA66309-1538). 

In a still further aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA 
encoding a polypeptide having at least about 80% sequence identity, preferably at least about 85% sequence 
identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence 

30 identity to the sequence of amino acid residues from about 32 to about 208, inclusive of Figure 298 (SEQ ID 
NO:416), or the complement of the DNA of (a). 

In a further aspect, the invention concerns an isolated nucleic acid molecule having at least about 50 
nucleotides, and preferably at least about 100 nucleotides and produced by hybridizing a test DNA molecule 
under stringent conditions with (a) a DNA molecule encoding a PRO 1271 polypeptide having die sequence of 

35 amino acid residues from about 32 to about 208. inclusive of Figure 298 (SEQ ID NO:416), or (b) die 
complement of the DNA molecule of (a), and. if the DNA molecule has at least about an 80% sequence identity, 
preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most 
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preferably at least about a 95% sequence identity to (a) or (b), isolating the test DNA molecule. 

In a speciflc aspect, the invention provides an isolated nucleic acid molecule comprising DNA encoding 
a PR01271 polypeptide, with or without the N-terminal signal sequence and/or die initiating methionine, and 
its soluble, i.e. transmembrane domain deleted or inactivated variants, or is complementary to such encoding 
nucleic acid molecule. The signal peptide has been tentatively identified as extending from amino acid position 
5 1 through about amino acid position 31 in the sequence of Figure 298 (SEQ ID NO:416), The transmembrane 
domain has been tentatively identified as extending from about amino acid position 1 66 through about amino acid 
position 187 in the PR01271 amino acid sequence (Figure 298, SEQ ID NO:416). 

In another aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA 
encoding a polypeptide scoring at least about 80% positives, preferably at least about 85% positives, more 
10 preferably at least about 90% positives, most preferably at least about 95% positives when compared with the 
amino acid sequence of residues 32 to about 208, inclusive of Figure 298 (SEQ ID NO:416), or (b) the 
complement of the DNA of (a). 

Another embodiment is directed to fragments of a PR01271 polypeptide coding sequence that may find 
use as hybridization probes. Such nucleic acid fragments are from about 20 to about 80 nucleotides in length, 
15 preferably from about 20 to about 60 nucleotides in length, more preferably from about 20 to about 50 
nucleotides in length, and most preferably from about 20 to about 40 nucleotides in length. 

In another embodiment, the invention provides isolated PRO 1 271 polypeptide encoded by any of the 
isolated nucleic acid sequences hereinabove defmed. 

In a specific aspect, the invention provides isolated native sequence PRO 1 27 1 polypeptide, which in one 
20 embodiment, includes an amino acid sequence comprising residues 32 through 208 of Figure 298 (SEQ ID 
NO:416). 

In another aspect, the invention concerns an isolated PRO 1 271 polypeptide, comprising an amino acid 
sequence having at least about 80% sequence identity, preferably at least about 85% sequence identity, more 
preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the 

25 sequence of amino acid residues 32 to about 208, inclusive of Figure 298 (SEQ ID NO:416). 

In a further aspect, the invention concerns an isolated PRO 1271 polypeptide, comprising an amino acid 
sequence scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least 
about 90% positives, most preferably at least about 95 % positives when compared with the amino acid sequence 
of residues 32 through 208 of Figure 298 (SEQ ID NO:416). 

30 In yet another aspect, the invention concerns an isolated PRO 1 27 1 polypeptide, comprising the sequence 

of amino acid residues 32 to about 208, inclusive of Figure 298 (SEQ ID NO:416), or a fragment thereof 
sufficient to provide a binding site for an anti-PROl271 antibody. Preferably, the PR01271 fragment retains 
a qualitative biological activity of a native PR01271 polypeptide. 

In a still further aspect, the invention provides a polypeptide produced by (i) hybridizing a test DNA 

35 molecule tmder stringent conditions with (a) a DNA molecule encoding a PR01271 polypeptide having the 
sequence of amino acid residues from about 32 to about 208, mclusive of Figure 298 (SEQ ID N0:4 16), or (b) 
the complement of die DNA molecule of (a), and if the test DNA molecule has at least about an 80% sequence 
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identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence 
identity, most preferably at least about a 95% sequence identity to (a) or (b), (ii) culturing a host cell comprising 
the test DNA molecule under conditions suitable for expression of the polypeptide, and (iii) recovering the 
polypeptide from the cell culture. 

In yet another embodiment, the invention concerns agonists and antagonists of a native PR01271 
5 polypeptide. In a particular embodiment, the agonist or antagonist is an anii-PR01271 antibody. 

In a further embodiment, the invention concerns a method of identifying agonists or antagonists of a 
native PR01271 polypeptide, by contacting the native PR01271 polypeptide with a candidate molecule and 
monitoring a biological activity mediated by said polypeptide. 

In a still further embodiment, the invention concerns a composition comprising a PRO 1 27 1 polypeptide , 
10 or an agonist or antagonist as hereinabove defined, in combination with a pharmaceutically acceptable carrier. 

132. PR01375 

A cDNA clone (DNA67004-1614) has been identified that encodes a novel polypeptide having sequence 
identity with PUT2 and designated in the present application as ** PRO 1375.** 
15 In one embodiment, the invention provides an isolated nucleic acid molecule comprising DNA encoding 

a PR01375 polypeptide. 

In one aspect, the isolated nucleic acid comprises DNA having at least about 80% sequence identity, 
preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most 
preferably at least about 95% sequence identity to (a) a DNA molecule encoding a PRO 1375 polypeptide having 

20 the sequence of amino acid residues from about I to about 198, inclusive of Figure 300 (SEQ ID NO:418). or 
(b) the complement of the DNA molecule of (a). 

In another aspect, the invention concerns an isolated nucleic acid molecule encoding a PR01375 
polypeptide comprising DNA hybridizing to the complement of the nucleic acid between about residues 104 and 
about 697. inclusive, of Figure 299 (SEQ ID NO:417). Preferably, hybridization occurs under stringent 

25 hybridization and wash conditions. 

in a further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA having 
at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least 
about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule 
encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203115 

30 (DNA67004-1614), or (b) the complement of the DNA molecule of (a). In a preferred embodiment, the nucleic 
acid comprises a DNA encoding the same mature polypeptide encoded by the human protein cDNA in ATCC 
Deposit No. 203115 (DNA67004.1614). 

In a still further aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA 
encoding a polypeptide having at least about 80% sequence identity, preferably at least about 85% sequence 

35 identity, more preferably at least about 90% sequence identity, most preferably ai least about 95% sequence 
identity to the sequence of amino acid residues from about 1 to about 198, inclusive of Figure 300 (SEQ ID 
NO:418), or the complement of the DNA of (a). 
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In a funher aspect « the invention concerns an isoiated nucleic acid molecule having at least about 50 
nucleotides, and preferably at least about 100 nucleotides and produced by hybridizing a test DNA molecule 
under stringent conditions with (a) a DNA molecule encoding a PRO 1375 polypeptide having the sequence of 
amino acid residues from about I to about 198. inclusive of Figure 300 (SEQ ID NO:418), or (b) the 
complement of the DNA molecule of (a), and. if the DNA molecule has at least about an 80% sequence identity. 
5 preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence identity, most 
preferably at least about a 95% sequence identity to (a) or (b), isolating the test DNA molecule. 

In a specific aspect, die invention provides an isolated nucleic acid molecule comprising DNA encoding 
a PR01375 polypeptide in its soluble form, i.e. transmembrane domains deleted or inactivated variants, or is 
complementary to such encoding nucleic acid molecule. The transmembrane domains have been tentatively 
10 identified as at about amino acid positions 1 1-28 (type II) and 103-125 of SEQ ID NO:4I8. 

In another aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA 
encoding a polypeptide scoring at least about 80% positives, preferably at least about 85% positives, more 
preferably at least about 90% positives, most preferably at least about 95% positives when compared with the 
amino acid sequence of residues 1 to about 198, inclusive of Figure 300 (SEQ ID NO:418), or (b) the 
15 complement of the DNA of (a). ' 

Another embodiment is directed to fragments of a PRO 1375 polypeptide coding sequence that may find 
use as hybridization probes. Such nucleic acid fragments are from about 20 to about 80 nucleotides in length, 
preferably from about 20 to about 60 nucleotides in length, more preferably from about 20 to about 50 
nucleotides in length, and most preferably from about 20 to about 40 nucleotides in length. 
20 In another embodiment, the invention provides isolated PRO 1375 polypeptide encoded by any of the 

isolated nucleic acid sequences hereinabove defined. 

In a specific aspect, the invention provides isolated native sequence PRO 1375 polypeptide, which in one 
embodiment, includes an amino acid sequence comprising residues I through 198 of Figure 300 (SEQ ID 
NO:418). 

25 In another aspect, the invention concerns an isolated PRO 1375 polypeptide, comprising an amino acid 

sequence having at least about 80% sequence identity, preferably at least about 85% sequence identity, more 
preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the 
sequence of amino acid residues 1 to about 198, inclusive of Figure 300 (SEQ ID NO:418). 

In a further aspect, the invention concerns an isolated PRO 1375 polypeptide, comprising an amino acid 

30 sequence scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least 
about 90% positives, most preferably at least about 95% positives when compared widi the amino acid sequence 
of residues 1 through 198 of Figure 300 (SEQ ID NO:418). 

In yet another aspect, the invention concerns an isolated PR01375 polypeptide, comprising the sequence 
of amino acid residues 1 to about 198, inclusive of Figure 300 (SEQ ID N0:418), or a fragment thereof 

35 sufficient to provide a binding site for an anti-PR01375 antibody. Preferably, the PR01375 fragment retains 
a qualitative biological activity of a native PRO 1 375 polypeptide. 
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In a still further aspect, the invention provides a polypeptide produced by (i) hybridizing a test DNA 
molecule under stringent conditions with (a) a DNA molecule encoding a PR01375 polypeptide having the 
sequence of amino acid residues from about 1 to about 198, inclusive of Figure 300 (SEQ ID NO:418), or (b) 
the complement of the DNA molecule of (a), and if the test DNA molecule has at least about an 80% sequence 
identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence 
identity, most preferably at least about a 95% sequence identity to (a) or (b). (ii) culturing a host cell comprising 
the lest DNA molecule under conditions suitable for expression of the polypeptide, and (iii) recovering the 
polypeptide from the ceil culture. 

In yet another embodiment, the invention concerns agonists and antagonists of a native PRO 1375 
polypeptide. In a particular embodiment, the agonist or antagonist is an anti-PR01375 antibody. 

In a fiinher embodiment, the invention concerns a method of identifying agonists or antagonists of a 
native PR01375 polypeptide, by contacting the native PR01375 polypeptide with a candidate molecule and 
monitoring a biological activity mediated by said polypeptide. 

In a still further embodiment, the invention concerns a composition comprising a PRO 1 375 polypeptide , 
or an agonist or antagonist as hereinabove defined, in combination with a pharmaceuiically acceptable carrier. 

133. PR01385 

A cDNA clone (DNA68869-1610) has been identified that encodes a novel secreted polypeptide, 
designated in the present application as ■PR01385". 

In one embodiment, the invention provides an isolated nucleic acid molecule comprising DNA encoding 
a PRO 1385 polypeptide. 

In one aspect, the isolated nucleic acid comprises DNA having at least about 80% sequence identity, 
preferably ai least about 85% sequence identity, more preferably at least about 90% sequence identity, most 
preferably at least about 95% sequence identity to (a) a DNA molecule encoding a PROI385 polypeptide having 
the sequence of amino acid residues from about 1 or about 29 to about 128, inclusive of Figure 302 (SEQ ID 
NO:420), or (b) the complement of the DNA molecule of (a). 

In another aspect, the invention concerns an isolated nucleic acid molecule encoding a PRO 1385 
polypeptide comprising DNA hybridizing to the complement of the nucleic acid between about nucleotides 26 
or about 1 10 and about 409, inclusive, of Figure 301 (SEQ ID NO:419). Preferably, hybridization occurs under 
stringent hybridization and wash conditions. 

In a further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA having 
at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least 
about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule 
encoding the same manire polypeptide encoded by the human protein cDNA in ATCC Deposit No. 203164 
(DNA68869-1610) or (b) the complement of the nucleic acid molecule of (a). In a preferred embodiment, the 
nucleic acid comprises a DNA encoding the same manire polypeptide encoded by the human protein cDNA in 
ATCC Deposit No. 203164 (DNA68869-1610). 
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In still a further aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DN A 
encoding a polypeptide having at least about 80% sequence identity, preferably at least about 85% sequence 
identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence 
identity to the sequence of amino acid residues 1 or about 29 to about 128, inclusive of Figure 302 (SEQ ID 
NO:420). or (b) the complement of the DNA of (a). 
5 In a further aspect, the invention concerns an isolated nucleic acid molecule having at least 245 

nucleotides and produced by hybridizing a lest DNA molecule under stringent conditions with (a) a DNA 
molecule encoding a PRO 1 385 polypeptide having the sequence of amino acid residues from 1 or about 29 to 
about 128. inclusive of Figure 302 (SEQ ID NO:420), or 0>) the complement of the DNA molecule of (a), and. 
if the DNA molecule has at least about an 80 % sequence identity, prefereably at least about an 85% sequence 

10 identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95% sequence 
identity to (a) or (b), isolating the test DNA molecule. 

In a specific aspect, the invention provides an isolated nucleic acid molecule comprising DNA encoding 
a PROI385 polypeptide, with or without the N-ierminal signal sequence and/or the initiating methionine, or is 
complementary to such encoding nucleic acid molecule. The signal peptide has been tentatively identified as 

15 extending from about amino acid position 1 to about amino acid position 28 in the sequence of Figure 302 (SEQ 
ID NO:420). 

In another aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA 
encoding a polypeptide scoring at least about 80% positives, preferably at least about 85% positives, more 
preferably at least about 90% positives, most preferably at least about 95% positives when compared with the 
20 amino acid sequence of residues 1 or about 29 to about 128, inclusive of Figure 302 (SEQ ID NO:420), or (b) 
the complement of the DNA of (a). 

Another embodiment is directed to fragments of a PRO 1385 polypeptide coding sequence thai may fmd 
use as hybridization probes. Such nucleic acid fragments arc from about 20 to about 80 nucleotides in length, 
preferably from about 20 to about 60 nucleotides in length, more preferably from about 20 to about 50 
25 nucleotides in length and most preferably from about 20 to about 40 nucleotides in length and may be derived 
from the nucleotide sequence shown in Figure 301 (SEQ ID NO:4I9). 

In another embodiment, the invention provides isolated PR01385 polypeptide encoded by any of die 
isolated nucleic acid sequences hereinabove identified. 

In a specific aspect, the invention provides isolated native sequence PR01385 polypeptide, which in 
30 certain embodiments, includes an amino acid sequence comprising residues 1 or about 29 to about 128 of Figure 
302 (SEQ ID NO:420). 

In another aspect, the invention concerns an isolated PR01385 polypeptide, comprising an amino acid 
sequence having at least about 80% sequence identity, preferably at least about 85% sequence identity, more 
preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to die 
35 sequence of amino acid residues 1 or about 29 to about 128, inclusive of Figure 302 (SEQ ID NO:420). 

In a funher aspect, the invention concerns an isolated PRO 1385 polypeptide, comprising an amino acid 
sequence scoring at least about 80% posuivcs. preferably at least about 85% positives, more preferably at least 
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about 90% positives, most preferably at least about 95% positives when compared with the amino acid sequence 
of residues 1 or about 29 to about 128, inclusive of Figure 302 (SEQ ID NO:420). 

In yet another aspect, the invention concerns an isolated PRO 1385 polypeptide, conq}rising the sequence 
of amino acid residues 1 or about 29 to about 128. inclusive of Figure 302 (SEQ ID NO:420), or a fragment 
thereof sufficient to provide a binding site for an anii-PR01385 antibody. Preferably, the PR0138S fragment 
5 retains a qualitative biological activity of a native PR01385 polypeptide. 

In a still further aspect, the invention provides a polypeptide produced by (i) hybridizing a test DNA 
molecule under stringent conditions with (a) a DNA molecule encoding a PRO 1385 polypeptide having the 
sequence of amino acid residues from about 1 or about 29 to about 128, inclusive of Figure 302 (SEQ ID 
NO:420), or (b) the complement of the DNA molecule of (a), and if the test DNA molecule has at least about 
10 an 80% sequence identity, preferably at least about an 85% sequence identity, more preferably at least about a 
90% sequence identity, most preferably at least about a 95% sequence identity to (a) or (b). (ii) culmring a host 
cell comprising the test DNA molecule under conditions suitable for expression of die polypeptide, and (iii) 
recovering the polypeptide from the cell culture. 

15 134. PR01387 

A cDNA clone (DNA68872-1620) has been identified, having homology to nucleic acid encoding 
myelin, that encodes a novel polypeptide, designated in the present application as "PR01387". 

In one embodiment, the invention provides an isolated nucleic acid molecule comprising DNA encoding 
a PRO 1387 polypeptide. 

20 In one aspect, the isolated nucleic acid comprises DNA having ai least about 80% sequence identity, 

preferably ai least about 85% sequence identity, more preferably at least about 90% sequence identity, most 
preferably at least about 95% sequence identity to (a) a DNA molecule encoding a PR01387 polypeptide having 
the sequence of amino acid residues from about 1 or about 20 to about 394, inclusive of Figure 304 (SEQ ID 
NO:422), or (b) the complement of the DNA molecule of (a). 

25 In another aspect, the invention concerns an isolated nucleic acid molecule encoding a PRO 1387 

polypeptide comprising DNA hybridizing to the complement of the nucleic acid between about nucleotides 85 
or about 142 and about 1266, inclusive, of Figure 303 (SEQ ID NO:421). Preferably, hybridization occurs 
tmder stringent hybridization and wash conditions. 

In a further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA having 

30 at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least 
about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule 
encoding the same mature polypeptide encoded by the human protein cDNA in ATCC Deposh No. 203160 
(DNA68872-1620) or (b) the complement of the nucleic acid molecule of (a). In a preferred embodiment, the 
nucleic acid comprises a DNA encoding the same mature polypeptide encoded by the human protein cDNA in 

35 ATCC Deposit No. 203160 (DNA68872-1620). 

In still a further aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA 
encoding a polypeptide having at least about 80% sequence identity, preferably at least about 85% sequence 
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identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence 
identity to the sequence of amino acid residues 1 or about 20 to about 394, inclusive of Figure 304 (SEQ ID 
NO:422), or (b) the complement of the DNA of (a). 

In a further as^ci, the invention concerns an isolated nucleic acid molecule having at least 395 
nucleoudes and produced by hybridizing a test DNA molecule under suingeni conditions with (a) a DNA 
5 molecule encoding a PRO 1387 polypeptide having the sequence of amino acid residues from 1 or about 20 to 
about 394, inclusive of Figure 304 (SEQ ID NO:422), or (b) the complement of the DNA molecule of (a), and. 
if the DNA molecule has at least about an 80 % sequence identity, prefereably at least about an 85% sequence 
identity, more preferably at least about a 90% sequence identity, most preferably at least about a 95% sequence 
identity to (a) or (b), isolating the test DNA molecule. 

10 In a specific aspect, the invention provides an isolated nucleic acid molecule comprising DNA encoding 

a PR01387 polypeptide, with or without die N-terminal signal sequence and/or die initiating mediionine. and 
its soluble, i.e., transmembrane domain deleted or inactivated variants, or is complementary to such encoding 
nucleic acid molecule. The signal peptide has been tentatively identified as extending from about amino acid 
position 1 to about amino acid position 19 in the sequence of Figure 304 (SEQ ID NO:422). The transmembrane 

15 domain has been tentatively identified as extending from about amino acid position 275 to about amino acid 
position 296 in die PR01387 amino acid sequence (Figure 304, SEQ ID NO:422). 

In another aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA 
encoding a polypeptide scoring at least about 80% positives, preferably at least about 85% positives, more 
preferably at least about 90% positives, most preferably at least about 95% positives when compared with die 

20 amino acid sequence of residues 1 or about 20 to about 394, inclusive of Figure 304 (SEQ ID NO:422), or (b) 
the complement of the DNA of (a). 

Anodier embodiment is directed to fragments of a PRO 1387 polypeptide coding sequence that may find 
use as hybridization probes. Such nucleic acid fragments are from about 20 to about 80 nucleoudes in lengdi, 
preferably from about 20 to about 60 nucleotides in lengdi. more preferably from about 20 to about 50 

25 nucleotides in length and most preferably from about 20 to about 40 nucleotides in length and may be derived 
from the nucleotide sequence shown in Figure 303 (SEQ ID NO:421). 

In anodier embodiment, die invention provides isolated PRO 1387 polypeptide encoded by any of die 
isolated nucleic acid sequences hereinabove idemiHed. 

In a specific aspect, the invention provides isolated native sequence PR01387 polypeptide, which in 

30 certain embodiments, includes an amino acid sequence comprising residues 1 or about 20 to about 394 of Figure 
304 (SEQ ID NO:422). 

In anodier aspect, die invention concerns an isolated PRO 1387 polypeptide, comprising an amino acid 
sequence having at least about 80% sequence idemity, preferably at least about 85% sequence identity, more 
preferably at least about 90% sequence identity, most preferably at least about 95% sequence idemity to die 
35 sequence of amino acid residues I or about 20 to about 394, inclusive of Figure 304 (SEQ ID NO:422). 

In a further aspect, die invention concerns an isolated PRO 1 387 polypeptide, comprising an amino acid 
sequence scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least 
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about 90% positives, most preferably at least about 95% positives when compared with the amino acid sequence 
of residues 1 or about 20 to about 394. inclusive of Figure 304 (SEQ ID NO:422). 

In yet another aspect, the invention concerns an isolated PRO 1 387 polypeptide, comprising the sequence 
of amino acid residues 1 or about 20 to about 394. inclusive of Figure 304 (SEQ ID NO:422), or a fragment 
thereof sufficient to provide a binding site for an anii-PROi387 antibody. Preferably, the PR01387 fragment 
5 retains a qualitative biological activity of a native PR01387 polypeptide. 

In a still further aspect, the invention provides a polypeptide produced by (i) hybridizing a test DNA 
molecule under stringent conditions with (a) a DNA molecule encoding a PRO 1387 polypeptide having the 
sequence of amino acid residues from about 1 or about 20 to about 394, inclusive of Figure 304 (SEQ ID 
NO:422), or (b) the complement of the DNA molecule of (a), and if the test DNA molecule has at least about 
10 an 80% sequence identity, preferably at least about an 85% sequence identity, more preferably at least about a 
90% sequence identity, most preferably at least about a 95% sequence identity to (a) or (b), (ii) culturing a host 
cell comprising the test DNA molecule under conditions suitable for expression of the polypeptide, and (iii) 
recovering the polypeptide from the cell culture. 

In yet another embodiment, the invention concerns agonists and antagonists of a native PRO 1387 
15 polypeptide. In a particular embodiment, the agonist or antagonist is an anti-PROI387 antibody. 

In a further embodiment, the invention concerns a method of identifying agonists or antagonists of a 
native PRO 1387 polypeptide by contacting the native PRO 1387 polypeptide with a candidate molecule and 
monitoring a biological activity mediated by said polypeptide. 

In a still further embodiment, the invention concerns a composition comprising a PRO 1387 polypeptide, 
20 or an agonist or antagonist as hereinabove defined, in combination with a pharmaceutically acceptable carrier. 

135. PR01384 

A cDNA clone, referred to herein as **DNA71 159**, has been identified that encodes a novel polypeptide 
having homology to NKG2-D protein designated in the present application as '*PR01384**. 
25 In one embodiment, the invention provides an isolated nucleic acid molecule comprising DNA encoding 

a PR01384 polypeptide. 

In one aspect, the isolated nucleic acid comprises DNA having at least about 80% sequence identity, 
preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, most 
preferably at least about 95% sequence identity to (a) a DNA molecule encoding a PR01384 polypeptide having 
30 the sequence of amino acid residues from about 1 to about 229, inclusive of Figure 306 (SEQ ID N0:424), or 
(b) die complement of the DNA molecule of (a). 

In another aspect, the invention concerns an isolated nucleic acid molecule encoding a PRO 1384 
polypeptide comprising DNA hybridizing to the complement of the nucleic acid between about residues 182 and 
about 868, inclusive, of Figure 305 (SEQ ID NO:423). Preferably, hybridization occurs under stringent 
35 hybridization and wash conditions. 

In a further aspect, the invention concerns an isolated nucleic acid molecule comprising DNA having 
at least about 80% sequence identity, preferably at least about 85% sequence identity, more preferably at least 
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about 90% sequence identity, most preferably at least about 95% sequence identity to (a) a DNA molecule 
encoding the same manire polypeptide encoded by the human protein cDNA in ATCC Deposu No. 203135 
(DNA71 159-1617), or (b) the complement of the DNA molecule of (a). In a preferred embodiment, the nucleic 
acid comprises a DNA encoding the same mature polypepude encoded by the human protein cDNA in ATCC 
Deposit No. 203135 (DNA7 1159-1617). 
5 In a still iurther aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA 

encoding a polypeptide having at least about 80% sequence identity, preferably at least about 85% sequence 
identity, more preferably at least about 90% sequence identity, most preferably at least about 95% sequence 
identity to the sequence of amino acid residues from about 1 to about 229, inclusive of Figure 306 (SEQ ID 
NO:424), or the complement of the DNA of (a). 

10 In a further aspect, the invention concerns an isolated nucleic acid molecule having at least about 50 

nucleotides, and preferably at least about 100 nucleotides and produced by hybridizing a test DNA molecule 
under stringent conditions with (a) a DNA molecule encoding a PRO 1384 polypeptide having the sequence of 
amino acid residues from about 1 to about 229, inclusive of Figure 306 (SEQ ID NO:424), or (b) the 
complement of the DNA molecule of (a), and, if the DNA molecule has at least about an 80% sequence identity, 

IS preferably at least about an 85 % sequence identity, more preferably at least about a 90% sequence identity, most 
preferably at least about a 95% sequence identity to (a) or (b), isolating the test DNA molecule. 

In a specific aspect, the invention provides an isolated nucleic acid molecule comprising DNA encoding 
a PR01384 polypeptide with its transmembrane domain deleted or inactivated, or is complementary to such 
encoding nucleic acid molecule. The transmembrane domain has been tentatively identified as extending from 

20 about amino acid position 32 through about amino acid position 57 in the PRO 1 384 amino acid sequence (Figure 
306. SEQ ID NO:424). 

In another aspect, the invention concerns an isolated nucleic acid molecule comprising (a) DNA 
encoding a polypeptide scoring at least about 80% positives, preferably at least about 85% positives, more 
preferably at least about 90% positives, most preferably at least about 95% positives when compared with the 
25 amino acid sequence of residues 1 to about 229. inclusive of Figure 306 (SEQ ID NO:424), or (b) the 
complement of the DNA of (a). 

Another embodiment is directed to fragments of a PRO 1384 polypeptide coding sequence that may find 
use as hybridization probes. Such nucleic acid fragments are from about 20 to about 80 nucleotides in length, 
preferably from about 20 to about 60 nucleotides in length, more preferably from about 20 to about 50 
30 nucleotides in length, and most preferably from about 20 to about 40 nucleotides in length. 

In another embodiment, the invention provides isolated PRO 1384 polypeptide encoded by any of the 
isolated nucleic acid sequences hereinabove defmed. 

In a specific aspect, the invention provides isolated native sequence PR01384 polypeptide, which in one 
embodiment, includes an amino acid sequence comprising residues I to 229 of Figure 306 (SEQ ID NO:424). 
35 In another aspect, the invention concerns an isolated PR01384 polypeptide, comprising an amino acid 

sequence having at least about 80% sequence identity, preferably at least about 85% sequence idemity, more 
preferably at least about 90% sequence identity, most preferably at least about 95% sequence identity to the 
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sequence of amino acid residues 1 to about 229, inclusive of Figure 306 (SEQ ID NO:424). 

In a further aspect, the invention concerns an isolated PR01384 polypeptide, comprising an amino acid 
sequence scoring at least about 80% positives, preferably at least about 85% positives, more preferably at least 
about 90% positives, most preferably at least about 95% positives compared with the amino acid sequence 
of residues 1 to 229 of Figure 306 (SEQ ID NO:424). 
5 In yet another aspect, the invention concerns an isolated PRO 1 384 polypeptide, comprising the sequence 

of amino acid residues 1 to about 229, inclusive of Figure 306 (SEQ ID NO:424), or a fragment thereof 
sufficient to provide a binding site for an anti-PR01384 antibody. Preferably, the PR01384 fragment retains 
a qualitative biological activity of a native PRO 1384 polypeptide. 

In a still further aspect, the invention provides a polypeptide produced by (i) hybridizing a test DNA 
10 molecule under stringent conditions with (a) a DNA molecule encoding a PRO 1384 polypeptide having the 
sequence of amino acid residues from about 1 to about 229, inclusive of Figure 306 (SEQ ID NO:424), or (b) 
the complement of the DNA molecule of (a), and if the test DNA molecule has at least about an 80% sequence 
identity, preferably at least about an 85% sequence identity, more preferably at least about a 90% sequence 
identity, most preferably at least about a 95% sequence identity to (a) or (b), (ii) culmring a host cell comprising 
IS the test DNA molecule under conditions suitable for expression of the polypeptide, and (iii) recovering the 
polypeptide from the cell cultiu-e. 

In yet another embodiment, the invention concerns agonists and antagonists of a native PROi384 
polypeptide. In a particular embodiment, the agonist or antagonist is an anii-PR01384 antibody. 

In a further embodiment, the invention concerns a method of identifying agonists or antagonists of a 
20 native PR01384 polypeptide, by contacting the native PRO 1384 polypeptide with a candidate molecule and 
monitoring a biological activity mediated by said polypeptide . 

In a still further embodiment, the invention concerns a composiiion comprising a PRO 1384 polypeptide, 
or an agonist or antagonist as hereinabove defined, in combination with a pharmaceutically acceptable carrier. 

25 136. Additional Embodiments 

In other embodiments of the present invention, the invention provides vectors comprising DNA 
encoding any of the above or below described polypeptides. A host cell comprising any such vector is also 
provided. By way of example, the host cells may be CHO cells, £. coli, or yeast. A process for producing any 
of the above or below described polypeptides is further provided and comprises culturing host cells under 

30 conditiorts suitable for expression of the desired polypeptide and recovering the desired polypeptide from the cell 
citlture. 

In other embodiments, the invention provides chimeric molecules comprising any of the above or below 
described polypeptides fused to a heterologous polypeptide or amino acid sequence. An example of stich a 
chimeric molecule comprises any of the above or below described polypeptides fused to an epitope tag sequence 
35 or a Fc region of an immunoglobulin. 

In another embodiment, the invention provides an antibody which specifically binds to any of the above 
or below described polypeptides. Optionally, the antibody is a monoclonal antibody. 
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In yei other embodiments, the invention provides oligonucleotide probes useful for isolating genomic 
and cDN A nucleotide sequences, wherein those probes may be derived from any of the above or below described 
nucleotide sequences. 

In other embodiments, the invention provides an isolated nucleic acid molecule comprising a nucleotide 
sequence that encodes a PRO polypeptide. 
5 In one aspect, the isolated nucleic acid molecule comprises a nucleotide sequence having at least about 

80% sequence idenuty, preferably at least about 81% sequence identity, more preferably at least about 82% 
sequence identity, yet more preferably at least about 83% sequence identity, yet more preferably at least about 
84% sequence identity, yet more preferably at least about 85% sequence identity, yet more preferably at least 
about 86% sequence identity, yet more preferably at least about 87% sequence identity, yet more preferably at 

10 least about 88% sequence identity, yet more preferably at least about 89% sequence identity, yet more preferably 
at least about 90% sequence identity, yet more preferably at least about 91% sequence identity, yet more 
preferably at least about 92% sequence identity, yet more preferably at least about 93% sequence identity, yet 
more preferably at least about 94% sequence identity, yet more preferably at least about 95 % sequence identity, 
yet more preferably at least about 96% sequence idenuty, yet more preferably at least abcmi 97% sequence 

15 identity, yet more preferably at least about 98% sequence identity and yet more preferably at least about 99% 
sequence identity to (a) a DNA molecule encoding a PRO polypeptide having a full-length amino acid sequence 
as disclosed herein, a full-length amino acid sequence lacking the signal peptide as disclosed herein or an 
extracellular domain of a transmembrane protein as disclosed herein, or (b) the complement of the DNA 
molecule of (a). 

20 In other aspects, the isolated nucleic acid molecule comprises a nucleotide sequence having at least about 

80% sequence identity, preferably at least about 81% sequence identity, more preferably at least about 82% 
sequence identity, yet more preferably at least about 83% sequence identity, yet more preferably at least about 
84% sequence identity, yet more preferably at least about 85% sequence identity, yet more preferably at least 
about 86% sequence identity, yet more preferably at least about 87% sequence identity, yet more preferably at 

25 least about 88% sequence identity, yet more preferably at least about 89% sequence identity, yet more preferably 
at least about 90% sequence identity, yet more preferably at least about 91% sequence identity, yet more 
preferably at least about 92% sequence identity, yet more preferably at least about 93% sequence identity, yet 
more preferably at least about 94% sequence idemity, yet more preferably at least about 95% sequence identity, 
yet more preferably at least about 96% sequence identity, yet more preferably at least about 97% sequence 

30 identity, yet more preferably at least about 98% sequence identity and yet more preferably at least about 99% 
sequence identity to (a) a DNA molecule having the coding sequence of a ftjU-length PRO polypeptide cDN A 
as disclosed herein, the coding sequence of a full-length PRO polypeptide lacking the signal peptide as disclosed 
herein or the coding sequence of an extracellular domain of a transmembrane PRO polypeptiude as disclosed 
herien, or (b) the complement of the DNA molecule of (a). 

35 In a further aspect, the invention concerns an isolated nucleic acid molecule comprising 

a nucleotide sequence having at least about 80% sequence idemity, preferably at least about 81% sequence 
identity, more preferably at least about 82% sequence identity, yet more preferably at least about 83% sequence 
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identity, yet more preferably at least about 84% sequence identity, yet more preferably at least about 85% 
sequence identity, yet more preferably at least about 86% sequence identity, yet more preferably at least about 
87% sequence identity, yet more preferably at least about 88% sequence identity, yet more preferably at least 
about 89% sequence identity, yet more preferably at least about 90% sequence identity, yet more preferably at 
least about 91 % sequence identity, yet more preferably at least about 92% sequence identity, yet more preferably 
5 at least about 93% sequence identity, yet more preferably ai least about 94% sequence identity, yet more 
preferably at least about 95% sequence identity, yet more preferably at least about 96% sequence identity, yet 
more preferably at least about 97% sequence identity, yet more preferably at least about 98% sequence identity 
and yet more preferably at least about 99% sequence identity to (a) a DNA molecule that encodes the same 
manire polypeptide encoded by any of the human protein cDN As deposited with the ATCC as disclosed herein. 

10 or (b) the complement of the DNA molecule of (a). 

Another aspect the invention provides an isolated nucleic acid molecule comprising a nucleotide 
sequence encoding a PRO polypeptide which is either transmembrane domain-deleted or transmembrane domain- 
inactivated, or is complementary to such encoding nucleotide sequence, wherein the transmembrane domain(s) 
of such polypeptide are disclosed herein. Therefore, soluble extracellular domains of the herein described PRO 

IS polypeptides are contemplated. 

Another embodiment is directed to fragments of a PRO polypeptide coding sequence that may fmd use 
as, for example, hybridization probes or for encoding fragments of a PRO polypeptide that may optionally 
encode a polypeptide comprising a binding site for an anii-PRO antibody. Such nucleic acid fragments are 
usually at least about 20 nucleotides in length, preferably at least about 30 nucleotides in length, more preferably 

20 at least about 40 nucleotides in length, yet more preferably at least about 50 nucleotides in length, yet more 
preferably at least about 60 nucleotides in length, yet more preferably at least about 70 nucleotides in length, yet 
more preferably at least about 80 nucleotides in length, yet more preferably at least about 90 nucleotides in 
length, yet more preferably at least about 100 nucleotides in length, yet more preferably at least about 1 10 
nucleotides in length, yet more preferably at least about 120 nucleotides in length, yet more preferably at least 

25 about 1 30 nucleotides in length, yet more preferably at least about 1 40 nucleotides in length, yet more preferably 
at least about 150 nucleotides in length, yet more preferably at least about 160 nucleotides in length, yet more 
preferably at least about 170 nucleotides in length, yet more preferably at least about 180 nucleotides in length, 
yet more preferably at least about 190 nucleotides in lengdi, yet more preferably at least about 200 nucleotides 
in length, yet more preferably at least about 250 nucleotides in length, yet more preferably at least about 300 

30 nucleotides in length, yet more preferably at least about 350 nucleotides in lengdi, yet more preferably at least 
about 400 nucleotides in length, yet more preferably at least about 450 nucleotides in length, yet more preferably 
at least about 500 nucleotides in length, yet more preferably at least about 600 nucleotides in length, yet more 
preferably at least about 700 nucleotides in length, yet more preferably at least about 800 nucleotides in length, 
yet more preferably at least about 900 nucleotides in length and yet more preferably at least about 1000 

35 nucleotides in length, wherein in this context the term *'aboui*' means the referenced nucleotide sequence length 
plus or minus 10% of that referenced length. It is noted that novel fragments of a PRO polypeptide-encoding 
nucleotide sequence may be determined in a routine manner by aligning the PRO polypepiide-cncoding nucleotide 
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sequence wiih other known nucleotide sequences using any of a number of well known sequence alignment 
programs and determining which PRO polypeptide-encoding nucleotide sequence fragment(s) are novel. All of 
such PRO polypeptide-encoding nucleotide sequences are contemplated herein. Also contemplated are the PRO 
polypeptide fragments encoded by these nucleotide molecule fragments, preferably those PRO polypeptide 
fragments that comprise a binding site for an anti*PRO antibody. 
5 In anodier embodiment, the invention provides isolated PRO polypeptide encoded by any of the isolated 

nucleic acid sequences hereinabove identified. 

In a certain aspect, the invention concerns an isolated PRO polypeptide, comprising an amino acid 
sequence having at least about 80% sequence identity, preferably at least about 81% sequence identity, more 
preferably at least about 82% sequence identity, yet more preferably at least about 83% sequence identity, yet 

10 more preferably at lease about 84% sequence identity, yet more preferably at least about 85% sequence identity, 
yet more preferably at least about 86% sequence identity, yet more preferably at least about 87% sequence 
identity, yet more preferably at least about 88% sequence identity, yet more preferably at least about 89% 
sequence identity, yet more preferably at least about 90% sequence identity, yet more preferably at least about 
91% sequence identity, yet more preferably at least about 92% sequence identity, yet more preferably at least 

IS about 93% sequence identity, yet more preferably at least about 94% sequence identity, yet more preferably at 
least about 95 % sequence identity, yet more preferably at least about 96% sequence identity, yet more preferably 
at least about 97% sequence identity, yet more preferably at least about 98% sequence identity and yet more 
preferably at least about 99% sequence identity to a PRO polypeptide having a full-length amino acid sequence 
as disclosed herein, a fiilMength amino acid sequence lacking the signal peptide as disclosed herein or an 

20 extracellular domain of a transmembrane protein as disclosed herein. 

In a further aspect, the invention concerns an isolated PRO polypeptide comprising an amino acid 
sequence having at least about 80% sequence identity, preferably ai least about 81% sequence identity, more 
preferably at least about 82% sequence identity, yet more preferably at least about 83% sequence identity, yet 
more preferably at least about 84% sequence identity, yet more preferably at least about 85% sequence identity, 

25 yet more preferably at least about 86% sequence identity, yet more preferably at least about 87% sequence 
identity, yet more preferably at least about 88% sequeiKe identity, yet more preferably at least about 89% 
sequence identity, yet more preferably at least about 90% sequence identity, yet more preferably at least about 
91 % sequence identity, yet more preferably at least about 92% sequence identity, yet more preferably at least 
about 93% sequence identity, yet more preferably at least about 94% sequence identity, yet more preferably at 

30 least about 95 % sequence identity, yet more preferably at least about 96% sequence identity, yet more preferably 
at least about 97% sequence identity, yet more preferably at least about 98% sequence identity and yet more 
preferably at least about 99% sequence idemity to an amino acid sequence encoded by any of the human protein 
cDNAs deposited with the ATCC as disclosed herein. 

In a further aspect, the invention concerns an isolated PRO polypeptide comprising an amino acid 

35 sequence scoring at least about 80% poshives, preferably at least about 81 % positives, more preferably at least 
about 82% positives, yet more preferably at least about 83% positives, yet more preferably at least about 84% 
positives, yet more preferably at least about 85% positives, yet more preferably at least about 86% positives. 
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yet more preferably at least about 87% positives, yet more preferably at least about 88% positives, yet more 
preferably at least about 89% positives, yet more preferably at least about 90% positives, yet more preferably 
at least about 91 % positives, yet more preferably at least about 92% positives, yet more preferably at least about 
93% positives, yet more preferably at least about 94% positives, yet more preferably at least about 95% 
positives, yet more preferably at least about 96% positives, yet more preferably at least about 97% positives, 
5 yet more preferably at least about 98% positives and yet more preferably at least about 99% positives when 
compared with the amino acid sequence of a PRO polypeptide having a full-length amino acid sequence as 
disclosed herein, a full-length amino acid sequence lacking the signal peptide as disclosed herein or an 
extracellular domain of a transmembrane protein as disclosed herein. 

In a specific aspect, the invention provides an isolated PRO polypeptide without the N-terminal signal 

10 sequence and/or the initiating methionine and is encoded by a nucleotide sequence that encodes such an amino 
acid sequence as hereinbefore described. Processes for producing the same are also herein described, wherein 
those processes comprise culturing a host cell comprising a vector which conq>rises the appropriate encoding 
nucleic acid molecule imder conditions suitable for expression of the PRO polypeptide and recovering the PRO 
polypeptide from the cell culture. 

IS Another aspect the invention provides an isolated PRO polypeptide which is either transmembrane 

domain-deleted or transmembrane domain-inactivated. Processes for producing the same are also herein 
described, wherein those processes comprise culturing a host cell comprising a vector which comprises the 
appropriate encoding nucleic acid molecule under conditions suitable for expression of the PRO polypeptide and 
recovering the PRO polypeptide from the cell culmre. 

20 Another embodiment of the present invention is directed to the use of a PRO polypeptide, or an agonist 

or antagonist thereof as hereinbefore described, or an anii-PRO antibody, for the preparation of a medicament 
useful in the treatment of a condition which is responsive to the PRO polypeptide, an agonist or antagonist 
thereof or an anti-PRO antibody. 

25 BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 shows a nucleotide sequence (SEQ ID NO: 1) of a native sequence PR0281 (UNQ244) cDNA, 

wherein SEQ ID NO:l is a clone designated herein as "DNA 16422- 1209". 

Figure 2 shows the amino acid sequence (SEQ ID N0:2) derived from the coding sequence of SEQ ID 

N0:1 shown in Figure 1. 

30 Figure 3 shows a nucleotide sequence (SEQ ID N0:5) of a native sequence PR0276 (UNQ243) cDN A, 

wherein SEQ ID N0:5 is a clone designated herein as "DNA 16435- 1208". 

Figure 4 shows the amino acid sequence (SEQ ID NO:6) derived from the coding sequence of SEQ ID 
N0:5 shown in Figure 3. 

Figure 5 shows a nucleotide sequence (SEQ ID N0:7) of a native sequence PROI89 (UNQ163) cDNA. 
35 wherein SEQ ID N0:7 is a clone designated herein as "DNA2 1642- 1391". 

Figure 6 shows the amino acid sequence (SEQ ID NO: 8) derived from the coding sequence of SEQ ID 
NO: 7 shown in Figure 5. 
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Figure 7 shows a nucleotide sequence designated herein as DNA]4187 (S£Q ID NO:9). 

Figure 8 ^ows a nucleotide sequence (SEQ ID NO: 13) of a native sequence PRO 1 90 (UNQ164) cDN A, 
wherein SEQ ID NO: 13 is a clone designated herein as ''DNA23334-1392". 

Figure 9 shows the amino acid seqi^nce (SEQ ID NO: 14) derived from the coding sequence of SEQ 
ID NO: 13 shown in Figure 8. 
5 Figure 10 shows a nucleotide sequence designated herein as DNA14232 (SEQ ID NO: 15). 

Figure 11 shows a nucleotide sequence (SEQ ID NO: 19) of a native sequence PR0341 (UNQ300) 
cDNA, wherein SEQ ID NO: 19 is a clone designated herein as '•DNA26288-1239". 

Figure 12 shows the amino acid sequence (SEQ ID NO:20) derived from the coding sequence of SEQ 
ID NO: 19 shown in Figure 11. 
10 Figure 13 shows a nucleotide sequence designated herein as DNAI2920 (SEQ ID NO:21). 

Figure 14 shows a nucleotide sequence (SEQ ID NO:22) of a native sequence PRO180 (UNQ154) 
cDNA, wherein SEQ ID NO:22 is a clone designated herein as ''DNA26843-1389". 

Figure 15 shows the amino acid sequence (SEQ ID NO:23) derived from the coding sequence of SEQ 
ID UO:21 shown in Figure 14. 
15 ' Figure 16 shows a nucleotide sequence designated herein as DNA12922 (SEQ ID NO:24). 

Figure 17 shows a nucleotide sequence (SEQ ID NO:27) of a native sequence PR0194 (UNQ168) 
cDNA, wherein SEQ ID NO:27 is a clone designated herein as 'DNA26844-1394". 

Figure 18 shows the amino acid sequence (SEQ ID NO:28) derived from the coding sequence of SEQ 
ID NO:27 shown in Figure 17. 
20 Figure 19 shows a nucleotide sequence (SEQ ID NO:29) of a native sequence PRO203 (UNQ177) 

cDNA, wherein SEQ ID NO:29 is a clone designated herein as "DNA30862-1396". 

Figure 20 shows the amino acid sequence (SEQ ID NO:30) derived from the coding sequence of SEQ 
ID NO:29 shown in Figure 19. 

Figure 21 shows a nucleotide sequence designated herein as DNA15618 (SEQ ID N0:31). 
25 Figure 22 shows a nucleotide sequence (SEQ ID NO:32) of a native sequence PRO290 (UNQ253) 

cDNA, wherein SEQ ID NO:32 is a clone designated herein as *DNA35680-1212". 

Figure 23 shows the amino acid sequence (SEQ ID NO:33) derived from the coding sequence of SEQ 
ID NO:32 shown in Figure 22. 

Figure 24 shows a nucleotide sequence (SEQ ID NO:35) of a native sequence PR0874 (UN(J441) 
30 cDNA, wherein SEQ ID NO:35 is a clone designated herein as "DNA40621-1440". 

Figure 25 shows the amino acid sequence (SEQ ID NO:36) derived from the coding sequence of SEQ 
ID NO:35 shown in Figure 24. 

Figure 26 shows a nucleotide sequence (SEQ ID N0:40) of a native sequence PRO710 (UNQ374) 
cDNA, wherein SEQ ID N0:40 is a clone designated herein as 'DNA44161-1434". 
35 Figure 27 shows the amino acid sequence (SEQ ID N0:41) derived from the codmg sequence of SEQ 

ID NO:40 shown in Figure 26. 

Figure 28 shows a nucleotide s^ence designated herein as DNA38190 (SEQ ID NO:42). 
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Figure 29 shows a nucleoiide sequence (SEQ ID NO:46) of a native sequence PROllSl (UNQ581) 
cDNA. wherein SEQ ID NO:46 is a clone designated herein as •DNA44694-1500". 

Figure 30 shows the amino acid sequence (SEQ ID NO:47) derived from the coding sequence of SEQ 
ID NO:46 shown in Figure 29. 

Figure 31 shows a nucleotide sequence (SEQ ID N0:5l) of a native sequence PR01282 (UNQ652) 
5 cDNA, wherein SEQ ID NO:51 is a clone designated herein as "DNA45495-1550". 

Figure 32 shows the amino acid sequence (SEQ ID NO:52) derived from the coding sequence of SEQ 
ID NO:51 shown in Figure 31. 

Figure 33 shows a nucleotide sequence (SEQ ID NO:56) of a native sequence PR0358 cDNA, wherein 
SEQ ID NO:56 is a clone designated herein as "DNA4736M154. 
10 Figure 34 shows the amino acid sequence (SEQ ID NO:57) derived from the coding sequence of SEQ 

ID NO:56 shown in Figure 33. 

Figures 35A-B show a nucleotide sequence (SEQ ID N0:61) of a native sequence PRO1310 cDNA. 
wherein SEQ ID NO:6! is a clone designated herein as "DNA47394-1572. 

Figure 36 shows the amino acid sequence (SEQ ID NO:62) derived from the coding sequence of SEQ 
15 ID NO:61 shown in Figures 35A-B. 

Figure 37 shows a nucleotide sequence (SEQ ID NO:66) of a native sequence PR0698 (UNQ362) 
cDNA, wherein SEQ ID NO:66 is a clone designated herein as ''DNA48320-1433'*. 

Figure 38 shows the amino acid sequence (SEQ ID NO:67) derived from the coding sequence of SEQ 
ID NO:66 shown in Figure 37. 
20 Figure 39 shows a nucleotide sequence designated herein as DNA39906 (SEQ ID NO:68). 

Figure 40 shows a nucleotide sequence (SEQ ID NO:72) of a native sequence PR0732 (UNQ396) 
cDNA, wherein SEQ ID NO:72 is a clone designated herein as "DNA48334-1435". 

Figure 41 shows the amino acid sequence (SEQ ID NO:73) derived from the coding sequence of SEQ 
ID NO:72 shown in Figure 40. 
25 Figure 42 shows a nucleotide sequence designated herein as DNA20239 (SEQ ID NO:74). 

Figure 43 shows a nucleotide sequence designated herein as DNA38050 (SEQ ID N0:7S). 

Figure 44 shows a nucleotide sequence designated herein as DNA40683 (SEQ ID NO:76). 

Figure 45 shows a nucleotide sequence designated herein as DNA42580 (SEQ ID NO:77). 

Figures 46A-B show a nucleotide sequence (SEQ ID NO:83) of a native sequence PROl 120 (UNQ559) 
30 cDNA, wherein SEQ ID NO:83 is a clone designated herein as "DNA48606-1479", 

Figure 47 shows the amino acid sequence (SEQ ID NO:84) derived from the coding sequence of SEQ 
ID NO:83 shown in Figures 46A-B. 

Figure 48 shows a nucleotide sequence (SEQ ID NO:94) of a native sequence PR0537 (UNQ338) 
cDNA, wherein SEQ ID NO:94 is a clone designated herein as "DNA49141-143r . 
35 Figure 49 shows the amino acid sequence (SEQ ID NO:95) derived from the coding sequence of SEQ 

ID NO:94 shown in Figure 48. 

Figure 50 shows a nucleotide sequence (SEQ ID NO:96) of a native sequence PR0536 (UNQ337) 
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cDNA, wherein SEQ ID NO:96 is a clone designated herein as "DNA49 142- 1430*. 

Figure 51 shows the amino acid sequence (SEQ ID NO:97) derived from the coding sequence of SEQ 
ID NO:96 shown in Figure 50. 

Figure 52 shows a nucleotide sequence (SEQ ID NO:98) of a native sequence PR0535 (UNQ336) 
cDNA, wherein SEQ ID NO:98 is a clone designated herein as "DNA49 143- 1429". 
5 Figure 53 shows the amino acid sequence (SEQ ID NO:99) derived from die coding sequence of SEQ 

ID NO:98 shown in Figure 52. 

Figure 54 shows a nucleotide sequence designated herein as DNA30861 (SEQ ID NO: 100). 

Figure 55 shows a nucleotide sequence designated herein as DNA36351 (SEQ ID NO: 101). 

Figure 56 shows a nucleotide sequence (SEQ ID NO: 102) of a native sequence PR0718 (UNQ386) 
10 cDNA, wherein SEQ ID NO: 102 is a clone designated herein as "DNA49647-1398". 

Figure 57 shows the amino acid sequence (SEQ ID NO: 103) derived from the coding sequence of SEQ 
ID NO: 102 shown in Figure 56. 

Figure 58 shows a nucleotide sequence designated herein as DNA15386 (SEQ ID NO: 104). 

Figure 59 shows a nucleotide sequence designated herein as DN A 16630 (SEQ ID NO: 105). 
15 Figure 60 shows a nucleotide sequence designated herein as DNA 16829 (SEQ ID NO: 106). 

Figure 61 shows a nucleotide sequence designated herein as DNA28357 (SEQ ID NO: 107). 

Figure 62 shows a nucleotide sequence designated herein as DNA435I2 (SEQ ID NO: 108). 

Figure 63 shows a nucleotide sequence (SEQ ID NO: 112) of a native sequence PR0872 (UNQ439) 
cDNA, wherein SEQ ID N0:1 12 is a clone designated herein as "DNA49819-1439". 
20 Figure 64 shows the amino acid sequence (SEQ ID NO: 1 13) derived from the coding sequence of SEQ 

ID NO: II 2 shown in Figure 63. 

Figure 65 shows a nucleotide sequence (SEQ ID NO: 1 14) of a native sequence PRO1063 (UNQi28) 
cDNA, wherein SEQ ID NO: 1 14 is a clone designated herein as "DNA49820-1427\ 

Figure 66 shows the amino acid sequence (SEQ ID NO: 1 15) derived from the coding sequence of SEQ 
25 ID NO: 1 14 shown in Figure 65. 

Figure 67 shows a nucleotide sequence (SEQ ID NO: 1 16) of a native sequence PR0619 (UNQ355) 
cDNA, wherein SEQ ID N0:1 16 is a clone designated herein as "DNA49821-1562*. 

Figure 68 shows the amino acid sequence (SEQ ID NO: 1 17) derived from the coding sequence of SEQ 
ID NO: 1 16 shown in Figure 67. 
30 Figure 69 shows a nucleotide sequence (SEQ ID NO: 1 18) of a native sequence PR0943 (UN(^80) 

cDNA, wherein SEQ ID NO:l 18 is a clone designated herein as "DNA52192-1369\ 

Figure 70 shows the amino acid sequence (SEQ ID NO: 1 19) derived from the coding sequence of SEQ 
ID NO:l 18 shown in Figure 69. 

Figure 71 shows a nucleotide sequence (SEQ ID NO:123) of a native sequence PR01188 (UNQ602) 
35 cDNA, wherein SEQ ID NO: 123 is a clone designated herein as "DNA52598-1518". 

Figure 72 shows the amino acid sequence (SEQ ID NO: 124) derived from the coding sequence of SEQ 
ID NO: 123 shown in Figure 71 . 
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Figure 73 shows a nucleotide sequence (SEQ ID NO: 128) of a native sequence PR01133 (UNQ571) 
cDNA. wherein SEQ ID NO:128 is a clone designated herein as ''DNA53913-1490". 

Figure 74 shows the amino acid sequence (SEQ ID NO: 129) derived from the coding sequence of SEQ 
ID NO: 128 shown in Figure 73. 

Figure 75 shows a nucleotide sequence (SEQ ID NO: 134) of a native sequence PR0784 (UN(3459) 
5 cDNA, wherein SEQ ID NO: 134 is a clone designated herein as "DNA53978-1443". 

Figure 76 shows the amino acid sequence (SEQ ID NO: 135) derived from the coding sequence of SEQ 
ID NO: 134 shown in Figure 75. 

Figure 77 shows a nucleotide sequence designated herein as DNA44661 (SEQ ID NO: 136). 

Figure 78 shows a nucleotide sequence (SEQ ID NO: 137) of a native sequence PR0783 (UN(J458) 
10 cDNA, wherein SEQ ID NO: 137 is a clone designated herein as "DNA53996-1442". 

Figure 79 shows the amino acid sequence (SEQ ID NO: 138) derived from the coding sequence of SEQ 
ID NO: 137 shown in Figure 78. 

Figure 80 shows a nucleotide sequence designated herein as DNA45201 (SEQ ID NO:139). 

Figure 81 shows a nucleotide sequence designated herein as DNA14575 (SEQ ID NO:140). 
15 Figure 82 shows a nucleotide sequence (SEQ ID NO: 145) of a native sequence PRO820 (UNQ503) 

cDNA, wherein SEQ ID NO: 145 is a clone designated herein as ''DNA5604 1-1416**. 

Figure 83 shows the amino acid sequence (SEQ ID NO: 146) derived from the coding sequence of SEQ 
ID NO: 145 shown in Figure 82. 

Figure 84 shows a nucleotide sequence (SEQ ID NO: 147) of a native sequence PRO1080 (UNQ537) 
20 cDNA, wherein SEQ ID NO: 147 is a clone designated herein as "DN A56047. 1456" . 

Figure 85 shows the amino acid sequence (SEQ ID NO: 148) derived from the coding sequence of SEQ 
ID NO: 147 shown in Figure 84. 

Figure 86 shows a nucleotide sequence designated herein as DNA36527 (SEQ ID NO: 149). 

Figure 87 shows a nucleotide sequence (SEQ ID NO: 150) of a native sequence PRO1079 (UNQ536) 
25 cDNA, wherein SEQ ID NO: 150 is a clone designated herem as "DNA56050-I455". 

Figure 88 shows the amino acid sequence (SEQ ID NO: 151) derived from the coding sequence of SEQ 
ID NO: 150 shown in Figure 87. 

Figure 89 shows a nucleotide sequence (SEQ ID NO: 152) of a native sequence PR0793 (UNCJ432) 
cDNA, wherein SEQ ID NO: 152 is a clone designated herein as "DNA561 10-1437". 
30 Figure 90 shows the amino acid sequence (SEQ ID NO: 1 53) derived from the coding sequence of SEQ 

ID NO: 152 shown in Figure 89. 

Figure 91 shows a nucleotide sequence designated herein as DNA50177 (SEQ ID NO: 154). 

Figure 92 shows a nucleotide sequence (SEQ ID NO: 155) of a native sequence PR01016 (UN(}499) 
cDNA, wherein SEQ ID NO:155 is a clone designated herein as "DNA561 13-1378". 
35 Figure 93 shows the amino acid sequence (SEQ ID NO: 156) derived from the coding sequence of SEQ 

ID NO: 155 shown in Figure 92. 
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Figure 94 shows a nucleotide sequence (SEQ ID NO: 157) of a native sequence PR01013 (UNQ496) 
cDNA. wherein SEQ ID NO: 157 is a clone designated herein as "DNA56410-1414*. 

Figure 95 shows the amino acid sequence (SEQ ID NO: 158) derived from the coding sequence of SEQ 
ID NO: 157 shown in Figure 94. 

Figure 96 shows a nucleotide sequence (SEQ ID NO: 159) of a native sequence PR0937 (UNQ474) 
5 cDNA, wherein SEQ ID NO: 159 is a clone designated herein as "DNA56436-1448". 

Figure 97 shows the amino acid sequence (SEQ ID NO: 160) derived from the coding sequence of SEQ 
ID NO: 159 shown in Figure 96. 

Figure 98 shows a micleoiide sequence (SEQ ID NO: 164) of a native sequence PR0842 (UNQ473) 
cDNA, wherein SEQ ID NO: 164 is a clone designated herein as "DNA56855-1447". 
10 Figure 99 shows the amino acid sequence (SEQ ID NO: 165) derived from the coding sequence of SEQ 

ID NO: 164 shown in Figure 98. 

Figure 100 shows a nucleotide sequence (SEQ ID NO: 166) of a native sequence PR0839 (UN(2472) 
cDNA, wherein SEQ ID NO: 166 is a clone designated herein as ''DNA56859-1445". 

Figiu-e 101 shows the amino acid sequence (SEQ ID NO: 167) derived from the coding sequence of SEQ 
15 ID NO: 1 66 shown in Figure 100. 

Figure 102 shows a nucleotide sequence (SEQ ID NO: 168) of a native sequence PROl 180 (UNQ594) 
cDNA, wherein SEQ ID NO: 168 is a clone designated herein as "DNA56860-1510". 

Figure 103 shows the amino acid sequence (SEQ ID NO: 169) derived from the coding sequence of SEQ 
ID NO: 168 shown in Figure 102. 
20 Figure 104 shows a nucleotide sequence (SEQ ID NO: 170) of a native sequence PR01134 (UNQ572) 

cDNA, wherein SEQ ID NO:170 is a clone designated herein as "DNA56865-149r. 

Figure 105 shows the amino acid sequence (SEQ ID NO: 1 7 1 ) derived from the coding sequence of SEQ 
ID NO: 170 shown in Figure 104. 

Figure 106 shows a nucleotide sequence designated herein as DNA52352 (SEQ ID NO: 172). 
25 Figure 107 shows a nucleotide sequence designated herein as DNA55725 (SEQ ID NO: 173). 

Figure 108 shows a nucleotide sequence (SEQ ID NO: 174) of a native sequence PRO830 (UN()470) 
cDNA, wherein SEQ ID NO: 174 is a clone designated herein as ''DNA56866-1342'. 

Figure 109 shows the amino acid sequence (SEQ ID NO: 175) derived from the coding sequence of SEQ 
ID NO: 174 shown in Figure 108. 
30 Figure 1 10 shows a nucleotide sequence (SEQ ID NO: 176) of a native sequence PROl 1 15 (UNQ558) 

cDNA, wherein SEQ ID NO: 176 is a clone designated herein as ''DNA56868-1478V 

Figure 1 1 1 shows the amino acid sequence (SEQ ID NO: 1 77) derived from the coding sequence of SEQ 
ID NO: 176 shown in Figure 1 10. 

Figure 1 12 shows a nucleotide sequence (SEQ ID NO: 178) of a native sequence PR01277 (UNQ647) 
35 cDNA. wherein SEQ ID NO:178 is a clone designated herein as •'DNA56869-1545". 

Figure 1 1 3 shows the amino acid sequence (SEQ ID NO: 1 79) derived from the coding sequence of SEQ 
ID NO: 178 shown in Figure 1 12. 
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Figure 1 14 shows a nucleotide sequence (SEQ ID NO: 180) of a native sequence PROl 135 (UNQS73) 
cDNA, wherein SEQ ID NO: 180 is a clone designated herein as ''DNA56870-1492'. 

Figure 1 1 5 shows the amino acid sequence (SEQ ID NO: 181) derived from the coding sequence of SEQ 
ID NO: 180 shown in Figure 114. 

Figure 1 16 shows a nucleotide sequence (SEQ ID NO: 182) of a native sequence PROl 1 14 (UNQ557) 
5 cDNA, wherein SEQ ID NO: 182 is a clone designated herein as •'DNA57033-1403". 

Figure 1 17 shows the amino acid sequence (SEQ ID NO: 183) derived from the coding sequence of SEQ 
ID NO: 182 shown in Figure 116. 

Figure 1 18 shows a nucleotide sequence designated herein as DNA48466 (SEQ ID NO: 184). 

Figure 1 19 shows a nucleotide sequence (SEQ ID NO: 188) of a native sequence PR0828 (UNQ469) 
10 cDNA, wherein SEQ ID NO: 188 is a clone designated herein as "DNAS7037-1444*. 

Figure 120 shows the amino acid sequence (SEQ ID NO: 189) derived from the coding sequence of SEQ 
ID NO: 188 shown in Figure 1 19. 

Figure 121 shows a nucleotide sequence (SEQ ID NO: 193) of a native sequence PRO1009 (UN(J493) 
cDNA, wherein SEQ ID NO:193 is a clone designated herein as "DNA57129-I413". 
15 Figure 122 shows the amino acid sequence (SEQ ID NO: 194) derived from the coding sequence of SEQ 

ID NO:193 shown in Figure 121. 

Figure 123 shows a nucleotide sequence designated herein as DNA50853 (SEQ ID NO: 195). 

Figure 124 shows a nucleotide sequence (SEQ ID NO: 196) of a native sequence PRO1007 (UN(3491) 
cDNA, wherein SEQ ID NO:196 is a clone designated herein as *'DNA57690-1374". 
20 Figure 125 shows the amino acid sequence (SEQ ID NO: 197) derived from the coding sequence of SEQ 

ID NO: 196 shown in Figure 124. 

Figure 126 shows a nucleotide sequence (SEQ ID NO: 198) of a native sequence PRO1056 (UNQ521) 
cDNA, wherein SEQ ID NO:198 is a clone designated herein as "DNA57693-1424". 

Figure 127 shows the amino acid sequence (SEQ ID NO: 199) derived from the coding sequence of SEQ 
25 ID NO: 198 shown in Figure 126. 

Figure 128 shows a nucleotide sequence (SEQ ID NO:200) of a native sequence PR0826 (UN(3467) 
cDNA, wherein SEQ ID NO:200 is a clone designated herein as "DNA57694-134r. 

Figure 129 shows the amino acid sequence (SEQ ID NO:201) derived from the coding sequence of SEQ 
ID NO:200 shown in Figure 128. 
30 Figure 130 shows a nucleotide sequence (SEQ ID NO:202) of a native sequence PR0819 (UN(3466) 

cDNA, wherein SEQ ID NO:202 is a clone designated herein as •DNA57695.1340". 

Figure 1 3 1 shows the amino acid sequence (SEQ ID NO:203) derived from the coding sequence of SEQ 
ID NO:202 shown in Figure 130. 

Figure 132 shows a nucleotide sequence (SEQ ID NO:204) of a native sequence PRO1006 (UN(3490) 
35 cDNA, wherein SEQ ID NO:204 is a clone designated herein as "DNA57699-1412V 

Figure 133 shows the amino acid sequence (SEQ ID NO:205) derived from the coding sequence of SEQ 
ID NO:204 shown in Figure 132. 
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Figure 134 shows a nucleotide sequence (SEQ ID NO:206) of a native sequence PROl 1 12 (UNQ555) 
cDNA, wherein SEQ ID NO:206 is a clone designated herein as "DNA57702-1476". 

Figure 1 35 shows the amino acid sequence (SEQ ID NO:207) derived from the coding sequence of SEQ 
ID NO:206 shown in Figure 134. 

Figure 136 shows a nucleotide sequence (SEQ ID NO:208) of a native sequence PRO1074 (UNQ531) 
5 cDNA, wherein SEQ ID NO:208 is a clone designated herein as "DNA57704-1452". 

Figure 137 shows the amino acid sequence (SEQ ID NO:209) derived from the coding sequence of SEQ 
ID NO:208 shown in Figure 136. 

Figure 138 shows a nucleotide sequence (SEQ ID NO:210) of a native sequence PR01(X)5 (UN(3489) 
cDNA, wherein SEQ ID NO:210 is a clone designated herein as ■DNA57708-1(X)5". 
10 Figure 139 shows the ammo acid sequence (SEQ ID N0:2 1 1) derived from the coding sequence of SEQ 

ID NO:210 shown in Figure 138. 

Figure 140 shows a nucleotide sequence (SEQ ID NO:212) of a native sequence PRO1073 (UNQ530) 
cDNA, wherein SEQ ID NO:212 is a clone designated herein as "DNA57710-145r . 

Figure 141 shows the amino acid sequence (SEQ ID NO:213) derived ftom the coding sequence of SEQ 
IS ID NO:212 shown m Figure 140: 

Figure 142 shows a nucleotide sequence designated herein as DNA5S938 (SEQ ID NO:214). 

Figure 143 shows a nucleotide sequence (SEQ ID NO:215) of a native sequence PROl 152 (UNQ582) 
cDNA. wherein SEQ ID NO:215 is a clone designated herein as "DNA5771 1-1501". 

Figure 144 shows the amino acid sequence (SEQ ID NO :2 16) derived from the coding sequence of SEQ 
20 ID NO:215 shown in Figure 143. 

Figure 145 shows a nucleotide sequence designated herein as DNA55807 (SEQ ID NO:2I7). 

Figure 146 shows a nucleotide sequence (SEQ ID N0:218) of a native sequence PROl 136 (UNQ574) 
cDNA, wherein SEQ ID NO:218 is a clone designated herein as "DNA57827-1493". 

Figure 147 shows the amino acid sequence (SEQ ID NO:219) derived from the coding sequence of SEQ 
25 ID NO:218 shown m Figure 146. 

Figure 148 shows a nucleotide sequence (SEQ ID NO:220) of a native sequence PR0813 (UNQ465) 
cDNA, wherem SEQ ID NO:220 is a clone designated herein as "DNA57834-1339". 

Figure 149 shows the amino acid sequence (SEQ ID NO:221) derived from the coding sequence of SEQ 
ID NO:220 shown in Figure 148. 
30 Figure 150 shows a nucleotide sequence (SEQ ID NO:222) of a native sequence PRO809 (UN(^64) 

cDNA, wherein SEQ ID NO:222 is a clone designated herein as •DNA57836-1338". 

Figure 151 shows the amino acid sequence (SEQ ID NO:223) derived from the coding sequence of SEQ 
ID NO:222 shown in Figure 150. 

Figure 152 shows a nucleotide sequence (SEQ ID NO:224) of a native sequence PR0791 (UN(^63) 
35 cDNA, wherein SEQ ID NO:224 is a clone designated herein as •DNA57838-1337". 

Figure 153 shows the amino acid sequence (SEQ ID NO:225) derived from the coding sequence of SEQ 
ID NO:224 shown in Figure 152. 
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Figure 154 shows a nucleotide sequence (SEQ ID NO:226) of a native sequence PRO1004 (UNQ488) 
cDNA, wherein SEQ ID NO:226 is a clone designated herein as "DNA57844'1410". 

Figure 155 shows the amino acid sequence (SEQ ID NO:227) derived from the coding sequence of SEQ 
ID NO:226 shown in Figure 154. 

Figure 156 shows a nucleotide sequence (SEQ ID NO:228) of a native sequence PROl 1 1 1 (UNQ554) 
5 cDNA, wherein SEQ ID NO:228 is a clone designated herein as "DNA5872 1-1475". 

Figure 157 shows the amino acid sequence (SEQ ID NO:229) derived from the coding sequence of SEQ 
ID NO:228 shown in Figure 156. 

Figure 158 shows a nucleotide sequence (SEQ ID NO:230) of a native sequence PROl 344 (UNQ699) 
cDNA» wherein SEQ ID NO:230 is a clone designated herein as ''DNA58723-1588'. 
10 Figure 159 shows the amino acid sequence (SEQ ID NO: 23 1) derived from the coding sequence of SEQ 

ID NO:230 shown in Figure 158. 

Figure 160 shows a nucleotide sequence (SEQ ID NO:235) of a native sequence PROl 109 (UNQ552) 
cDNA« wherein SEQ ID NO:235 is a clone designated herein as "DNAS8737-1473**. 

Figure 161 shows the amino acid sequence (SEQ ID NO:236) derived from the coding sequence of SEQ 
15 ID NO:235 shown in Figure 160. 

Figure 162 shows a nucleotide sequence (SEQ ID NO:240) of a native sequence PR01383 (UNQ719) 
cDNA, wherein SEQ ID NO:240 is a clone designated herein as "DNA58743-1609". 

Figure 163 shows the amino acid sequence (SEQ ID NO: 24 1) derived from the coding sequence of SEQ 
ID NO:240 shown in Figure 162. 
20 Figure 164 shows a nucleotide sequence (SEQ ID NO:245) of a native sequence PRO1003 (UN(J487) 

cDNA, wherein SEQ ID NO:245 is a clone designated herein as "DNA58846-1409". 

Figure 165 shows the amino acid sequence (SEQ ID NO:246) derived from the coding sequence of SEQ 
ID NO:245 shown in Figure 164. 

Figure 166 shows a nucleotide sequence (SEQ ID NO:247) of a native sequence PROl 108 (UNQ551) 
25 cDNA, wherein SEQ ID NO:247 is a clone designated herein as "DNA58848-1472". 

Figure 167 shows the amino acid sequence (SEQ ID NO:248) derived from the coding sequence of SEQ 
ID NO:247 shown in Figure 166. 

Figure 168 shows a nucleotide sequence (SEQ ID NO:249) of a native sequence PROl 137 (UNQ575) 
cDNA, wherein SEQ ID NO:249 is a clone designated herein as "DNA58849-1494". 
30 Figure 169 shows the amino acid sequence (SEQ ID NO:250) derived from the coding sequence of SEQ 

ID NO:249 shown in Figure 168. 

Figure 170 shows a nucleotide sequence (SEQ ID NO:252) of a native sequence PROl 138 (UNQ576) 
cDNA, wherein SEQ ID NO:252 is a clone designated herein as "DNA58850-1495". 

Figure 171 shows the amino acid sequence (SEQ ID NO:253) derived from the coding sequence of SEQ 
35 ID NO:252 shown in Figure 170. 

Figure 172 shows a nucleotide sequence designated herein as DNA49140 (SEQ ID NO:254). 
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Figure 173 shows a micleotide sequence (SEQ ID NO:255) of a native sequence PROI054 (UNQ519) 
cDNA, wherein SEQ ID NO:255 is a clone designated herein as "DNA58853-1423". 

Figure 174 shows the amino acid sequence (SEQ ID NO:256) derived from the coding sequence of SEQ 
ID NO:255 shown in Figure 173. 

Figure 175 shows a nucleotide sequence (SEQ ID NO:257) of a native sequence PR0994 (UNQ518) 
5 cDNA, wherein SEQ ID NO:257 is a clone designated herein as "DNA58855-1422". 

Figure 176 shows the amino acid sequence (SEQ ID NO:258) derived from the coding sequence of SEQ 
ID NO:257 shown in Figure 175. 

Figure 177 shows a nucleotide sequence (SEQ ID NO:259) of a native sequence PR0812 (UNQ517) 
cDNA. wherein SEQ ID NO:259 is a clone designated herein as •DNA59205-142r . 
10 Figure 178 shows the amino acid sequence (SEQ ID NO:260) derived from the coding sequence of SEQ 

ID NO:259 shown in Figure 177. 

Figure 179 shows a nucleotide sequence (SEQ ID NO:261) of a native sequence PRO1069 (UNQ526) 
cDNA, wherein SEQ ID NO:261 is a clone designated herein as "DNA5921 1-1450". 

Figure 180 shows the amino acid sequence (SEQ ID NO:262) derived from the codmg sequence of SEQ 
15 ID NO:261 shown in Figure 179. 

Figure 181 shows a nucleotide sequence (SEQ ID NO:263) of a native sequence PROl 129 (UNQ568) 
cDNA, wherein SEQ ID NO:263 is a clone designated herein as •DNA59213-1487". 

Figure 1 82 shows the amino acid sequence (SEQ ID NO:264) derived from the coding sequence of SEQ 
ID NO:263 shown in Figure 181. 
20 Figure 183 shows a nucleotide sequence (SEQ ID NO:265) of a native sequence PRO 1068 (UNQ525) 

cDNA. wherein SEQ ID NO:265 is a clone designated herein as •DNA59214-1449\ 

Figure 184 shows the amino acid sequence (SEQ ID NO:266) derived from the coding sequence of SEQ 
ID NO:265 shown in Figure 183. 

Figure 185 shows a nucleotide sequence (SEQ ID NO:267) of a native sequence PRO1066 (UNQ524) 
25 cDNA, wherein SEQ ID NO:267 is a clone designated herein as ''DNA59215-1425". 

Figure 186 shows the amino acid sequence (SEQ ID NO:268) derived from the coding sequence of SEQ 
ID NO:267 shown m Figure 185. 

Figure 187 shows a nucleotide sequence (SEQ ID NO:269) of a native sequence PROl 184 (UNQ598) 
cDNA, wherein SEQ ID NO:269 is a clone designated herein as ''DNA59220-1514". 
30 Figure 188 shows the amino acid sequence (SEQ ID NO:270) derived from the coding sequence of SEQ 

ID NO:269 shown in Figure 187. 

Figure 189 shows a nucleotide sequence (SEQ ID NO:27I) of a native sequence PRO1360 (UNQ709) 
cDNA, wherein SEQ ID NO:271 is a clone designated herein as 'DNA59488-1603". 

Figure 190 shows the amino acid sequence (SEQ ID NO:272) derived from the coding sequence of SEQ 
35 ID NO:271 shown in Figure 189. 

Figure 191 shows a nucleotide sequence (SEQ ID NO:273) of a native sequence PRO 1029 (UNQ514) 
cDNA, wherein SEQ ID NO:273 is a clone designated herein as •DNA59493-1420*. 
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Figure 192 shows the amino acid sequence (SEQ ID NO:274) derived from the coding sequence of SEQ 
ID NO:273 shown in Figure 191. 

Figure 193 shows a nucleotide sequence (SEQ ID NO:275) of a native sequence PROl 139 (UNQ577) 
cDNA, wherein SEQ ID NO:275 is a clone designated herein as "DNA59497-1496". 

Figure 194 shows the amino acid sequence (SEQ ID NO:276) derived from the coding sequence of SEQ 
ID NO:275 shown in Figure 193. 

Figure 195 shows a nucleotide sequence (SEQ ID NO:277) of a native sequence PRO1309 (UNQ675) 
cDNA, wherein SEQ ID NO:277 is a clone designated herein as "DNA59588-157r. 

Figure 196 shows the amino acid sequence (SEQ ID NO:278) derived from the coding sequence of SEQ 
ID NO:277 shown in Figure 195. 

Figure 197 shows a nucleotide sequence (SEQ ID NO:280) of a native sequence PRO1028 (UNQ513) 
cDNA, wherein SEQ ID NO:280 is a clone designated herein as "DNA59603-1419". 

Figure 198 shows the amino acid sequence (SEQ ID NO:281) derived from the coding sequence of SEQ 
ID NO:280 shown in Figure 197. 

Figure 199 shows a nucleotide sequence (SEQ ID NO:282) of a native sequence PRO1027 (UNQ5i2) 
cDNA. wherein SEQ ID NO:282 is a clone designated herein as "DNA59605-1418". 

Figure 200 shows the amino acid sequence (SEQ ID NO:283) derived from the coding sequence of SEQ 
ID NO:282 shown in Figure 199. 

Figure 201 shows a nucleotide sequence (SEQ ID NO:284) of a native sequence PROl 107 (UNQ550) 
cDNA, wherein SEQ ID NO:284 is a clone designated herein as "DNA59606-147r . 

Figure 202 shows the amino acid sequence (SEQ ID NO:285) derived from the coding sequence of SEQ 
ID NO:284 shown in Figure 201. 

Figure 203 shows a nucleotide sequence (SEQ ID NO:286) of a native sequence PROl 140 (UNQ578) 
cDNA, wherein SEQ ID NO:286 is a clone designated herein as "DNA59607-1497\ 

Figure 204 shows the amino acid sequence (SEQ ID NO:287) derived from the coding sequence of SEQ 
ID NO:286 shown in Figure 203. 

Figure 205 shows a nucleotide sequence (SEQ ID NO:288) of a native sequence PROl 106 (UNQ549) 
cDNA. wherein SEQ ID NO:288 is a clone designated herein as "DNA59609-1470". 

Figure 206 shows the amino acid sequence (SEQ ID NO:289) derived from the coding sequence of SEQ 
ID NO:288 shown in Figure 205. 

Figure 207 shows a nucleotide sequence (SEQ ID NO:290) of a native sequence PRO 129 1 (UNQ659) 
cDNA. wherein SEQ ID NO:290 is a clone designated herein as "DNA59610-1556". 

Figure 208 shows the amino acid sequence (SEQ ID NO:291) derived from the coding sequence of SEQ 
ID NO:290 shown in Figure 207. 

Figure 209 shows a nucleotide sequence (SEQ ID NO:292) of a native sequence PROl 105 (UNQ548) 
cDNA, wherein SEQ ID NO:292 is a clone designated herein as "DNA59612-1466". 

Figure 210 shows the amino acid sequence (SEQ ID NO:293) derived from the coding sequence of SEQ 
ID NO:292 shown in Figure 209. 
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Figure 21 1 shows a nucleoilde sequence (SEQ ID NO:294) of a native sequence PR051 1 (UNQ51 i) 
cDNA, wherein SEQ ID NO:294 is a clone designated herein as *DNA59613-I417". 

Figure 212 shows die amino acid sequence (SEQ ID NO:295) derived from the coding sequence of SEQ 
ID NO:294 shown in Figure 211. 

Figure 213 shows a nucleotide sequence (SEQ ID NO:296) of a native sequence PROl 104 (UNQ547) 
5 cDNA, wherein SEQ ID NO:296 is a clone designated herein as "DNA59616-1465". 

Figure 214 shows the amino acid sequence (SEQ ID NO:297) derived from the coding sequence of SEQ 
ID NO:296 shown in Figure 213, 

Figure 215 shows a nucleotide sequence (SEQ ID NO:298) of a native sequence PROl 100 (UNQ546) 
cDNA, wherein SEQ ID NO:298 is a clone designated herein as "DNA59619-1464". 
10 Figure 216 shows the amino acid sequence (SEQ ID NO:299) derived from the coding sequence of SEQ 

ID NO:298 shown in Figure 215. 

Figure 217 shows a nucleotide sequence (SEQ ID NO:300) of a native sequence PR0836 (UNQ545) 
cDNA, wherein SEQ ID NO:300 is a clone designated herein as "DNA59620-1463". 

Figure 218 shows the amino acid sequence (SEQ ID NO:301) derived from the coding sequence of SEQ 
15 ID NO:300 shown in Figure 217. 

Figure 219 shows a nucleotide sequence (SEQ ID NO:302) of a native sequence PROl 141 (UNQ579) 
cDNA, vdierein SEQ ID NO:302 is a clone designated herein as *DNA59625-1498". 

Figure 220 shows the amino acid sequence (SEQ ID NO:303) derived from the coding sequence of SEQ 
ID NO:302 shown in Figure 219, 
20 Figure 221 shows a nucleotide sequence designated herein as DNA33128 (SEQ ID NO:304). 

Figure 222 shows a nucleotide sequence designated herein as DNA34256 (SEQ ID NO:305). 

Figure 223 shows a nucleotide sequence designated herein as DNA47941 (SEQ ID NO:306). 

Figure 224 shows a nucleotide sequence designated herein as DNA54389 (SEQ ID NO:307). 

Figure 225 shows a nucleotide sequence (SEQ ID NO:308) of a native sequence PROl 132 (UNQ570) 
25 cDNA, wherein SEQ ID NO:308 is a clone designated herein as "DNA59767-1489". 

Figure 226 shows the amino acid sequence (SEQ ID NO:309) derived from the coding sequence of SEQ 
ID NO:308 shown in Figure 225. 

Figure 227 shows a nucleotide sequence (SEQ ID NO:313) of a native sequence PR01346 cDNA, 
wherein SEQ ID NO:313 is a clone designated herein as "DNA59776-1600". 
30 Figure 228 ^ws the amino acid sequence (SEQ ID NO: 3 14) derived from the coding sequence of SEQ 

ID NO:313 shown in Figure 227. 

Figure 229 shows a nucleotide sequence (SEQ ID N0:3 18) of a native sequence PROl 131 (UNQ569) 
cDNA, wherein SEQ ID NO:318 is a clone designated herein as 'DNA59777*1480". 

Figure 230 shows the amino acid sequence (SEQ ID NO:319) derived from the coding sequence of SEQ 
35 ID NO:318 shown in Figure 229. 

Figure 231 shows a nucleotide sequence designated herein as DNA43546 (SEQ ID NO:320). 
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Figure 232 shows a nucleotide sequence (SEQ ID NO:325) of a native sequence PR01281 (UNQ651) 
cDNA, wherein SEQ ID NO:325 is a clone designated herein as "DNA59820-1549". 

Figure 233 shows the amino acid sequence (SEQ ID NO:326) derived from the coding sequence of SEQ 
ID NO:325 shown in Figure 232. 

Figure 234 shows a nucleotide sequence (SEQ ID NO:333) of a native sequence PRO 1064 (UNQl 1 1) 
5 cDNA. wherein SEQ ID NO:333 is a clone designated herein as "DNA59827-1426". 

Figure 235 shows the amino acid sequence (SEQ ID NO:334) derived from the coding sequence of SEQ 
ID NO:333 shown in Figure 234. 

Figure 236 shows a nucleotide sequence designated herein as DNA45288 (SEQ ID NO:335). 
Figure 237 shows a nucleotide sequence (SEQ ID NO:339) of a native sequence PR01379 (UNQ7i6) 
10 cDNA. wherein SEQ ID NO:339 is a clone designated herein as "DNA59828-1608". 

Figure 238 shows the amino acid sequence (SEQ ID NO:340) derived from die coding sequence of SEQ 
ID NO:339 shown in Figure 237. 

Figure 239 shows a nucleotide sequence (SEQ ID NO:344) of a native sequence PR0844 (UNQ544) 
cDNA, wherein SEQ ID N0:344 is a clone designated herein as "DNA59838-1462". 
1 5 Figure 240 shows die amino acid sequence (SEQ ID NO:345) derived from die coding sequence of SEQ 

ID NO:344 shown in Figure 239. 

Figure 241 shows a nucleotide sequence (SEQ ID NO:346) of a native sequence PR0848 (UNQ543) 
cDNA, wherein SEQ ID NO:346 is a clone designated herein as "DNA59839-146r . 

Figure 242 shows the amino acid sequence (SEQ ID NO:347) derived from die coding sequence of SEQ 
20 ID NO:346 shown in Figure 24 1 . 

Figure 243 shows a nucleotide sequence (SEQ ID NO:348) of a native sequence PRO1097 (UNQ542) 
cDNA, wherein SEQ ID NO:348 is a clone designated herein as "DNA5984 1-1460". 

Figure 244 shows die amino acid sequence (SEQ ID NO:349) derived from die coding sequence of SEQ 
ID NO:348 shown in Figure 243. 
25 Figure 245 shows a nucleotide sequence (SEQ ID NO:350) of a native sequence PROl 153 (UNQ583) 

cDNA, wherein SEQ ID NO:350 is a clone designated herein as ''DNA59842-1502". 

Figure 246 shows die amino acid sequence (SEQ ID NO:35 1 ) derived from die coding sequence of SEQ 
ID NO:350 shown in Figure 245. 

Figure 247 shows a nucleotide sequence (SEQ ID NO;352) of a native sequence PROl 154 (UNQ584) 
30 cDNA, wherein SEQ ID NO:352 is a clone designated herein as "DNA59846-1503". 

Figure 248 shows die amino acid sequence (SEQ ID NO:353) derived from die coding sequence of SEQ 
ID NO:352 shown in Figure 247. 

Figure 249 shows a nucleotide sequence (SEQ ID NO:354) of a native sequence PROl 181 (UNQ595) 
cDNA, wherein SEQ ID NO:354 is a clone designated herein as "DNA59847-15ir. 
35 Figure 250 shows die amino acid sequence (SEQ ID NO:355) derived from die coding sequence of SEQ 

ID NO:354 shown in Figure 249. 
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Figure 251 shows a nucleotide sequence (SEQ ID NO:356) of a native sequence PROl 182 (UNQS96) 
cDNA. wherein SEQ ID NO:356 is a clone designated herein as "DNA59848-1512-. 

Figure 252 shows the amino acid sequence (SEQ ID NO:357) derived from the coding sequence of SEQ 
ID NO:356 shown in Figure 251. 

Figure 253 shows a nucleotide sequence (SEQ ID NO:358) of a native sequence PROl 155 (UNQ585) 
5 cDNA, wherein SEQ ID NO:358 is a clone designated herein as "DNA59849-1504". 

Figure 254 shows the amino acid sequence (SEQ ID NO:359) derived from the coding sequence of SEQ 
ID NO:358 shown in Figure 253, 

Figure 255 shows a nucleotide sequence (SEQ ID NO:360) of a native sequence PROl 156 (UNQ586) 
cDNA, wherein SEQ ID NO:360 is a clone designated herein as "DNA59853-1505". 
10 Figure 256 shows the amino acid sequence (SEQ ID NO:361) derived from the coding sequence of SEQ 

ID NO:360 shown in Figure 255. 

Figure 257 shows a nucleotide sequence (SEQ ID NO:362) of a native sequence PRO1098 (UNQ541) 
cDNA, wherein SEQ ID NO:362 is a clone designated herein as "DNA59854-1459". 

Figure 258 shows the amino acid sequence (SEQ ID NO: 363) derived from the coding sequence of SEQ 
15 ID NO:362 shown in Figure 257. 

Figure 259 shows a nucleotide sequence (SEQ ID NO:364) of a native sequence PROl 127 (UNQ565) 
cDNA, wherein SEQ ID NO:364 is a clone designated herein as "DNA60283-1484". 

Figure 260 shows the amino acid sequence (SEQ ID NO:365) derived from the coding sequence of SEQ 
ID NO:364 shown in Figure 259. 
20 Figure 261 shows a nucleotide sequence (SEQ ID NO:366) of a native sequence PROl 126 (UNQ564) 

cDNA. wherein SEQ ID NO:366 is a clone designated herein as "DNA60615-1483". 

Figure 262 shows the amino acid sequence (SEQ ID NO:367) derived from the coding sequence of SEQ 
ID NO:366 shown in Figure 261. 

Figure 263 shows a nucleotide sequence (SEQ ID NO:368) of a native sequence PROl 125 (UNQ563) 
25 cDNA, wherein SEQ ID NO:368 is a clone designated herein as "DNA60619-1482". 

Figure 264 shows the amino acid sequence (SEQ ID NO:369) derived from the coding sequence of SEQ 
ID NO:368 shown in Figure 263. 

Figure 265 shows a nucleotide sequence (SEQ ID NO:370) of a native sequence PROl 186 (UNQ600) 
cDNA, wherein SEQ ID NO:370 is a clone designated herein as "DNA6062I-1516". 
30 Figure 266 shows the amino acid sequence (SEQ ID NO:37 1 ) derived from the coding sequence of SEQ 

ID NO:370 shown in Figure 265. 

Figure 267 shows a nucleotide sequence (SEQ ID NO:372) of a native sequence PROl 198 (VNQSl 1) 
cDNA, wherein SEQ ID NO:372 is a clone designated herein as •DNA60622-1525V 

Figure 268 shows the amino acid sequence (SEQ ID NO:373) derived from the coding sequence of SEQ 
35 ID NO:372 shown in Figure 267. 

Figure 269 shows a nucleotide sequence (SEQ ID NO:374) of a native sequence PROl 158 (UNQ588) 
cDNA. wherein SEQ ID NO:374 is a clone designated herein as •DNA60625-1507*. 
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Figure 270 shows the amino acid sequence (SEQ ID NO:375) derived from the coding sequence of SEQ 
ID NO:374 shown in Figure 269. 

Figure 271 shows a nucleotide sequence (SEQ ID NO:376) of a native sequence PROl 159 (UNQ589) 
cDNA, wherein SEQ ID NO:376 is a clone designated herein as •DNA60627«1508". 

Figure 272 shows the amino acid sequence (SEQ ID NO:377) derived from the coding sequence of SEQ 
5 ID NO:376 shown in Figure 271 . 

Figure 273 shows a nucleotide sequence (SEQ ID NO:378) of a native sequence PROl 124 (UNQ562) 
cDNA, wherein SEQ ID NO:378 is a clone designated herein as 'DNA60629-148r . 

Figure 274 shows the amino acid sequence (SEQ ID NO:379) derived from the coding sequence of SEQ 
ID NO:378 shown in Figure 273. 
10 Figure 275 shows a nucleotide sequence (SEQ ID NO:380) of a native sequence PR01287 (UNQ656) 

cDNA, wherein SEQ ID NO:380 is a clone designated herein as "DNA61755-1554". 

Figure 276 shows the amino acid sequence (SEQ ID NO:38 1 ) derived from the coding sequence of SEQ 
ID NO:380 shown in Figure 275. 

Figure 277 shows a nucleotide sequence (SEQ ID NO:386) of a native sequence PR01312 (UNQ678) 
15 cDNA. wherein SEQ ID NO:386 is a clone designated herein as "DNA6 1873- 1574". 

Figure 278 shows the amino acid sequence (SEQ ID NO:387) derived from the coding sequence of SEQ 
ID NO:386 shown in Figure 277. 

Figure 279 shows a nucleotide sequence (SEQ ID NO:388) of a native sequence PROl 192 (UNQ606) 
cDNA. wherein SEQ ID NO:388 is a clone designated herein as "DNA62814-152r . 
20 Figure 280 shows the amino acid sequence (SEQ ID NO:389) derived from the coding sequence of SEQ 

ID NO:388 shown in Figure 279. 

Figure 281 shows a nucleotide sequence (SEQ ID NO:393) of a native sequence PROl 160 (UNQ590) 
cDNA, wherein SEQ ID NO:393 is a clone designated herein as '■DNA62872-1509\ 

Figure 282 shows the amino acid sequence (SEQ ID NO:394) derived from the coding sequence of SEQ 
25 ID NO:393 shown in Figure 28 1 . 

Figure 283 shows a nucleotide sequence (SEQ ID NO:398) of a native sequence PROl 187 (UNQ601) 
cDNA, wherein SEQ ID NO:398 is a clone designated herein as "DNA62876-1517". 

Figure 284 shows the amino acid sequence (SEQ ID NO:399) derived from the coding sequence of SEQ 
ID NO:398 shown in Figure 283. 
30 Figure 285 shows a nucleotide sequence (SEQ ID NO:400) of a native sequence PROl 185 (UNQ599) 

cDNA, wherein SEQ ID NO:400 is a clone designated herein as "DNA6288M515". 

Figure 286 shows the amino acid sequence (SEQ ID NO:401) derived from the coding sequence of SEQ 
ID NO:400 shown in Figure 285. 

Figure 287 shows a nucleotide sequence (SEQ ID NO:402) of a native sequence PR01345 (UNQ700) 
35 cDNA, wherein SEQ ID NO:402 is a clone designated herein as "DNA64852-1589". 

Figure 288 shows the amino acid sequence (SEQ ID NO:403) derived from the coding sequence of SEQ 
ID NO:402 shown in Figure 287. 



298 



BNSOOCID: <WO_9963Q8aA2J.> 



wo 99/63088 



PCTAJS99/12252 



Figure 289 shows a nucleotide sequence (SEQ ID NO:407) of a native sequence PR01245 (UNQ629) 
cDNA, wherein SEQ ID NO:407 is a clone designated herein as "DNA64884-1527\ 

Figure 290 shows the amino acid sequence (SEQ ID NO:408) derived from the coding sequence of SEQ 
ID NO:407 shown in Figure 289. 

Figure 291 shows a nucleotide sequence (SEQ ID NO:409) of a native sequence PR01358 (UNQ707) 
5 cDNA, wherein SEQ ID NO:409 is a clone designated herein as "DNA64890-1612". 

Figure 292 shows die amino acid sequence (SEQ ID NO:410) derived from the coding sequence of SEQ 
ID NO:409 shown in Figure 291. 

Figure 293 shows a nucleotide sequence (SEQ ID N0:41 1) of a native sequence PROl 195 (UNQ608) 
cDNA, wherein SEQ ID NO:41 1 is a clone designated herein as "DNA65412-1523". 
10 Figure 294 shows the amino acid sequence (SEQ ID N0:4 12) derived from the coding sequence of SEQ 

ID N0:41 1 shown in Figure 293. 

Figure 295 shows a nucleotide sequence (SEQ ID NO:413) of a native sequence PRO1270 (UNQ640) 
cDNA, wherein SEQ ID NO:413 is a clone designated herein as •DNA66308-1537". 

Figure 296 shows the amino acid sequence (SEQ ID NO:414) derived from the coding sequence of SEQ 
15 ID NO:413 shown in Figure 295. 

Figure 297 shows a nucleotide sequence (SEQ ID NO:415) of a native sequence PR01271 (UNQ641) 
cDNA, wherein SEQ ID NO:415 is a clone designated herein as "DNA66309-1538". 

Figure 298 shows the amino acid sequence (SEQ ID N0:4 16) derived from the coding sequence of SEQ 
ID NO:415 shown in Figure 297. 
20 Figure 299 shows a nucleotide sequence (SEQ ID NO:417) of a native sequence PR01375 (UNQ712) 

cDNA, wherein SEQ ID NO:417 is a clone designated herein as -DNA67004-1614". 

Figure 300 shows the amino acid sequence (SEQ ID NO:418) derived from the coding sequence of SEQ 
ID NO:417 shown in Figure 299. 

Figure 301 shows a nucleotide sequence (SEQ ID NO:419) of a native sequence PR01385 (UNQ720) 
25 cDNA, wherein SEQ ID NO:419 is a clone designated herein as "DNA68869-1610". 

Figure 302 shows the amino acid sequence (SEQ ID NO:420) derived from the coding sequence of SEQ 
ID NO:419 shown m Figure 301. 

Figure 303 shows a nucleotide sequence (SEQ ID NO;421) of a native sequence PR01387 (UNQ722) 
cDNA, wherein SEQ ID NO:421 is a clone designated herein as "DNA68872-1620". 
30 Figure 304 shows the amino acid sequence (SEQ ID NO:422) derived from the coding sequence of SEQ 

ID NO:421 shown m Figure 303. 

Figure 305 shows a nucleotide sequence (SEQ ID NO:423) of a native sequence PR01384 (UNQ721) 
cDNA. wherein SEQ ID NO:423 is a clone designated herein as •DNA71 159-1617". 

Figure 306 shows the amino acid sequence (SEQ ID NO:424) derived from the coding sequence of SEQ 
35 ID NO:423 shown in Figure 305. 
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DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 
I. Definitions 

The terms "PRO polypeptide" and *PRO" as used herein and when immediately followed by a 
numerical designation refer to various polyj^iides, wherein the complete designation (i.e. , PRO/number) refers 
to specific polypeptide sequences as described herein. The terms "PRO/nimiber polypeptide" and 
5 **PRO/number" wherein the terra "number" is provided an actual numerical designation as used herein 
encompass native sequence polypeptides and polypeptide variants (which are further defined herein). The PRO 
polypeptides described herein may be isolated from a variety of sources, such as from human tissue types or 
from another source, or prepared by recombinant or synthetic methods. 

A "native sequence PRO polypeptide" comprises a polypeptide having the same amino acid sequence 

10 as the corresponding PRO polypeptide derived from nature. Such native sequence PRO polypeptides can be 
isolated from nature or can be produced by recombinant or synthetic means. The term "native sequence PRO 
polypeptide" specifically encompasses naturally-occurring truncated or secreted forms of the specific PRO 
polypeptide (e.g., an extracellular domain sequence), namrally-occurring variant forms (e.g., alternatively 
spliced forms) and namrally-occurring allelic variants of the polypeptide. In various embodiments of the 

15 invention, the native sequence PR0281 polypeptide is a mature or fiilMength native sequence PR0281 
polypeptide comprising amino acids 1 to 345 of Figure 2 (SEQ ID NO:2), the native sequence PR0276 is a full- 
length or mature native sequence PR0276 comprising amino acids 1 through 251 of Figure 4 (SEQ ID NO:6). 
the native sequence PRO 189 is a full-length or mature native sequence PRO 189 comprising amino acids 1 
through 367 of Figure 6 (SEQ ID NO: 8), the native sequence PRO 190 polypeptide is a full-length or mature 

20 native sequence PRO190 polypeptide comprising amino acids 1 through 424 of Figure 9 (SEQ ID NO: 14), the 
native sequence PR0341 is a mature or full-length native sequence PR0341 comprising amino acids 1 to 458 
of Figure 12 (SEQ ID NO:20). the native sequence PRO180 is a full-length or manire native sequence PRO180 
comprising amino acids 1 through 266 of Figure 15 (SEQ ID NO:23), the native sequence PR0194 polypeptide 
is a mature or full-length native sequence PRO 194 polypeptide comprising amino acids 1 to 264 of Figure 18 

25 (SEQ ID NO:28), the native sequence PRO203 polypeptide is a mature or full-length native sequence PRO203 
polypeptide comprising amino acids 1 to 347 of Figure 20 (SEQ ID NO:30), the native sequence PRO290 is a 
full-length or mature native sequence PRO290 comprising amino acids 1 through 1003 of Figure 23 (SEQ ID 
NO:33), the native sequence PR0874 polypeptide comprises amino acids 1 to 321 of Figure 25 (SEQ ID 
NO:36), the native sequence PRO710 polypeptide is a mature or full-length native sequence PRO710polypeptide 

30 comprising amino acids 1 to 566 of Figure 27 (SEQ ID NO:41), the native sequence PR01151 is a mature or 
full-length native sequence PROl 151 comprising amino acids 1 to 259 of Figure 30 (SEQ ID NO:47), the native 
sequence PR01282 is a full-length or mature native sequence PR01282 comprising amino acids 1 or about 24 
through 673 of Figure 32 (SEQ ID NO:52), the native sequence PR0358 is a mature or full-length native 
sequence PR0358 polypeptide comprising amino acids 1 to 811 of Figure 34 (SEQ ID NO:57), the native 

35 sequence PRO1310 is a full-length or mature native sequence PRO 13 10 comprising amino acids 1 through 765 
of Figure 36 (SEQ ID NO:62), the native sequence PR0698 polypeptide is a mamre or full-length native 
sequence PR0698 polypeptide comprising amino acids 1 to 510 of Figure 38 (SEQ ID NO:67), the native 
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sequence PR0732 polypeptide is a mature or full-length native sequence PR0732 polypeptide comprising amino 
acids I to 453 of Figure 41 (SEQ ID NO:73). the native sequence PRO 1 120 is a full-length or mature native 
sequence PR01120 comprising amino acids 1 or about 18 to 867 of Figure 47 (SEQ ID NO:84), the native 
sequence PR0537 is a mature or lull-length native sequence PR0537 comprising amino acids 1 to 1 15 of Figure 
49 (SEQ ID NO:95). the native sequence PR0536 is a manire or full-length native sequence PR0536 comprising 
5 amino acids 1 to 313 of Figure 51 (SEQ ID NO:97), the native sequence PR0535 is a manire or full-length 
native sequence PR0535 comprising amino acids 1 to 201 of Figure 53 (SEQ ID NO:99), the native sequence 
PR0718 polypeptide is a mature or full-length native sequence PR0718 polypeptide comprising amino acids 1 
to 157 of Figure 57 (SEQ ID NO: 103), the native sequence PR0872 polypeptide is a mature or full-length native 
sequence PR0872 polypeptide comprising amino acids 1 to 610 of Figure 64 (SEQ ID NO: 113). the native 

10 sequence PRO1063 polypeptide is a mamre or fiill-length native sequence PRO1063 polypeptide comprising 
amino acids 1 to 301 of Figure 66 (SEQ ID N0:II5)« the native sequence PR06I9 is a full-length or mature 
native sequence PR0619 comprising amino acids I or about 21 through 123 of Figure 68 (SEQ ID NO: 117), 
the native sequence PR0943 is a mamre or full-length native sequence PR0943 comprising amino acids 1 to 504 
of Figure 70 (SEQ ID NO: 119), the native sequence PRO 1188 is a full-length or mature native sequence 

15 PROl 188 comprising amino acids 1 or about 22 to 1 184 of Figure 72 (SEQ ID NO: 124), the native sequence 
PROl 133 is a full-length or manire native sequence PROl 133 comprising amino acids 1 or about 19 through 
438 of Figure 74 (SEQ ID NO: 129), the native sequence PR0784 is a manire or full-length native sequence 
PR0784 comprising amino acids 16 to 228 of Figure 76 (SEQ ID NO: 135), the native sequence PR0783 
polypeptide is a mature or full-length native sequence PR0783 polypeptide comprising amino acids 1 to 489 of 

20 Figure 79 (SEQ ID NO: 138), the native sequence PRO820 is a full-length or manire native sequence PRO820 
comprising amino acids 1 or 16 through 124 of Figure 83 (SEQ ID NO: 146). the native sequence PRO1080 is 
a full-length or mature native sequence PRO1080 comprising amino acids 1 or 23 through 358 of Figure 85 (SEQ 
ID NO: 148), the native sequence PRO1079 is a fiill-lengih or mature native sequence PRO1079 comprising 
amino acids 1 or about 30 to 226 of Figure 88 (SEQ ID NO: 15 1), the native sequence PR0793 is a mature or 

25 full-length native sequence PR0793 comprising amino acids I to 138 of Figure 90 (SEQ ID NO: 153), the native 
sequence PRO 1016 is a full-length or mamre native sequence PRO1016 comprising amino acids 1 or 19 through 
378 of Figure 93 (SEQ ID NO: 156), the native sequence PRO1013 polypeptide is a fiill-length or manire native 
sequence PRO1013 polypeptide comprising amino acids 1 or 20 through 409 of Figure 95 (SEQ ID NO: 158). 
the native sequence PR0937 polypeptide is a manire or fiill-length native sequence PR0937 polypeptide 

30 comprising amino acids 1 to 556 of Figure 97 (SEQ ID NO: 160). the native sequence PR0842 is a fiill-length 
or manire native sequence PR0842 comprising amino acids 1 or about 23 to 1 19 of Figure 99 (SEQ ID 
NO: 165), the native sequence PR0839 is a full-length or mamre native sequence PR0839 comprising ammo 
acids 1 or about 24 to 87 of Figure 101 (SEQ ID NO: 167), the native sequence PROl 180 polypeptide is a mamre 
or fiiU-iength native sequence PROl 180 polypeptide comprising amino acids 1 to 277 of Figure 103 (SEQ ID 

35 NO: 169). the native sequence PROl 134 is a mamre or full-length native sequence PROl 134 comprising amino 
acids 1 to 371 of Figure 105 (SEQ ID NO: 171). the native sequence PRO830 is a mature or full-length native 
sequence PRO830 comprising amino acids 1 to 87 of Figure 109 (SEQ ID NO: 175). the native sequence 
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PRO! 1 15 Is a fuIMength or mature native sequence PROl 1 15 comprising amino acids 1 or about 21 to 445 of 
Figure 1 1 1 (SEQ ID NO: 177). the native sequence PR01277 is a fuIMength or manire native sequence PR01277 
comprising amino acids 1 or about 27 to 678 of Figure 1 13 (SEQ ID NO: 179), the native sequence PROl 135 
polypeptide is a mamre or full-length native sequence PROl 135 polypeptide comprising amino acids 1 to 541 
of Figure 1 15 (SEQ ID NO: 181), the native sequence PROl 1 14 interferon receptor is a mature or full-length 
5 native sequence PROl 114 interferon receptor comprising amino acids 1 to311 ofFigure 118 (SEQ ID NO: 184), 
the native sequence PR0828 polypeptide is a mamre or fuIMength native sequence PR0828 polypeptide 
comprising amino acids 1 to 187 ofFigure 120 (SEQ ID NO: 189). the native sequence PR01(X)9 is a fuIMength 
or manire native sequence PR01(X)9 comprising amino acids 1 or 23 to 615 of Figure 122 (SEQ ID NO: 194), 
the native sequence PRO1007 polypeptide is a full-length or manire native sequence PR0l(X)7 polypeptide 

10 comprising amino acids 1 or 31 through 346 of Figure 125 (SEQ ID NO: 197), the native sequence PRO1056 
is a mamre or fidMength native sequence PRO1056 comprising amino acids 1 to 120 of Figure 127 (SEQ ID 
NO: 199), the native sequence PR0826 is a manire or fuIMength native sequence PR0826 comprising amino 
acids 1 to 99 of Figure 129 (SEQ ID NO:201). the native sequence PR0819 is a manire or full-length native 
sequence PR0819 comprising amino acids 1 to 52 of Figure 131 (SEQ ID NO:203). the native sequence 

15 PRO1006 is a full-length or mamre native sequence PRO1006 comprising amino acids 1 or 24 through 392 of 
Figure 133 (SEQ ID NO:205). the native sequence PROl 112 polypeptide is a fuIMength or manire native 
sequence PROl 1 12 polypeptide comprising amino acids 1 or 14 through 262 of Figure 135 (SEQ ID NO:207), 
the native sequence PRO1074 polypeptide is a mature or fuIMength native sequence PRO1074 polypeptide 
comprising amino acids 1 to 331 of Figure 137 (SEQ ID NO:209), the native sequence PRO1005 is a fuIMength 

20 or mamre native sequence PRO1005 comprising amino acids 1 or about 21 to 185 of Figure 139 (SEQ ID 
N0:21 1), the native sequence PRO1073 is a fuIMength or mature native sequence PRO1073 comprising amino 
acids 1 or about 32 to 299 ofFigure 141 (SEQ ID NO:213). the native sequence PROl 152 is a mamre or full- 
length native sequence PROl 152 comprising amino acids 1 to 479 of Figure 144 (SEQ ID NO:216). the native 
sequence PRO 11 36 is a mamre or fuIMength native sequence PRO 11 36 comprising amino acids 1 to 632 of 

25 Figure 147 (SEQ ID NO:219), the native sequence PR0813 polypeptide is a mature or fuIMength native 
sequence PR0813 polypeptide comprising amino acids 1 to 76 of Figure 149 (SEQ ID NO:221), the native 
sequence PRO809 is a fuIMength or mamre native sequence PRO809 comprising amino acids 1 or 19 through 
265 of Figure 151 (SEQ ID NO:223), the native sequence PR0791 is a fuIMength or mature native sequence 
PR0791 comprising amino acids 1 or 26 through 246 of Figure 153 (SEQ ID NO:225), the native sequence 

30 PRO1004 is a fuIMength or mamre native sequence PRO1004 comprising amino acids 1 or about 25 through 
115 of Figure 155 (SEQ ID NO:227). the native sequence PROllll is a fuIMength or mature native sequence 
PROl 1 1 1 comprising amino acids 1 through 653 of Figure 157 (SEQ ID NO:229). the native sequence PROl 344 
is a mamre or full-length native sequence PRO 1344 comprising amino acids 1 to 720 of Figure 159 (SEQ ID 
NO:231), the native sequence PROl 109 is a mamre or fuIMength native sequence PROl 109 comprising amino 

35 acids 1 to 344 of Figure 161 (SEQ ID NO:236). the native sequence PR01383 is a mamre or fuIMength native 
sequence PR01383 comprising amino acids 1 to 423 of Figure 163 (SEQ ID NO:241), the native sequence 
PRO1003 polypeptide is a mamre or full-length native sequence PRO1003 polypeptide comprising amino acids 
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1 to 84 of Figure 165 (SEQ ID NO:246), the native sequence PRO1108 polypeptide is a mature or full-length 
native sequence PROl 108 polypeptide comprising amino acids 1 to 456 of Figure 167 (SEQ ID NO:248), the 
native sequence PROl 137 polypeptide is a mature or full-length native setjuence PRO 11 37 polypeptide 
comprising amino acids 1 to 240 of Figure 169 (SEQ ID NO:250), the native sequence PROl 138 polypeptide 
is a mature or full-length native sequence PROl 138 polypeptide comprising amino acids 1 to 335 of Figure 171 
5 (SEQ ID NO:253), the native sequence PRO1054 is a mature or full-length native sequence PRO1054 comprising 
amino acids 1 to 180 of Figure 174 (SEQ ID NO:256), the native sequence PR0994 is a manire or full-length 
native sequence PR0994 comprising amino acids 1 to 229 of Figure 176 (SEQ ID NO:258)» the native sequence 
PR0812 is a mature or full-length native sequence PR0812 comprising amino acids 1 to 83 of Figure 178 (SEQ 
ID NO:260). the native sequence PRO1069 polypeptide is a mature or full-length native sequence PRO1069 

10 polypeptide comprising amino acids 1 to 89 of Figure 180 (SEQ ID NO:262), the native sequence PROl 129 
polypeptide is a mature or fiilMength native sequence PROl 129 polypeptide con^>rising amino acids 1 to 524 
of Figure 182 (SEQ ID NO:264), the native sequence PRO1068 is a full-length or mature native sequence 
PRO1068 comprising amino acids 1 or about 21 to 124 of Figure 184 (SEQ ID NO:266), the native sequence 
PRO1066 polypeptide is a mature or fidl-length native sequence PRO 1066 polypeptide comprising amino acids 

15 1 to 1 17 of Figure 186 (SEQ ID NO:268), the native sequence PROl 184 polypeptide is a full-length or manire 
native sequence PROl 184 polypeptide comprising ammo acids 1 or 39 through 142 of Figure 188 (SEQ ID 
NO:270), the native sequence PRO1360 is a full-length or mature native sequence PRO 1360 comprising amino 
acids 1 or about 30 through 285 of Figure 190 (SEQ ID NO:272), the native sequence PRO1029 is a mature or 
full-length native sequence PRO1029 comprising amino acids 1 to 86 of Figure 192 (SEQ ID NO:274), the 

20 native sequence PROl 139 is a mature or full-length native sequence PROl 139 polypeptide comprising amino 
acids 1 to 131or 29-131 of Figure 194 (SEQ ID NO:276), the native sequence PRO1309 is a full-length or 
mature native sequence PRO1309 comprising amino acids I or about 35 through 522 of Figure 196 (SEQ ID 
NO:278), the native sequence PRO 1028 polypeptide is a fiill-length or mature native sequence PRO1028 
polypeptide comprising amino acids 1 or 20 through 197 of Figure 198 (SEQ ID NO:281), the native sequence 

25 PRO1027 is a fiill-length or mature native sequence PRO1027 comprising amino acids 1 or 34 through 77 of 
Figure 200 (SEQ ID NO:283), the native sequence PROl 107 polypeptide is a full-length or mamre native 
sequence PROl 107 polypeptide comprising amino acids 1 or 23 through 477 of Figure 202 (SEQ ID NO:285). 
the native sequence PRO 1140 polypeptide is a mature or fiill-length native sequence PROl 140 polypeptide 
conq>rising amino acids 1 to 255 of Figure 204 (SEQ ID NO:287), the native sequence PROl 106 polypeptide 

30 is a fiill-length or mature native sequence PROl 106 polypeptide comprising amino acids 1 or 17 through 469 
of Figure 206 (SEQ ID NO:289), the native sequence PR01291 is a mamre or fiill-length native sequence 
PR01291 comprising amino acids 1 to 282 of Figure 208 (SEQ ID NO:291), the native sequence PROl 105 
polypeptide is a fidl-length or manire native sequence PROl 105 polypeptide comprising amino acids 1 or 20 
through 180 of Figure 210 (SEQ ID NO:293). die native sequence PRO1026 is a fiill-length or mamre native 

35 sequence PRO 1026 comprising amino acids 1 or 26 through 237 of Figure 212 (SEQ ID NO:295). the native 
sequence PROl 104 is a fiill-length or mamre native sequence PROl 104 comprising amino acids 1 or about 23 
through 341 of Figure 214 (SEQ ID NO:297), the native sequence PROl 100 is a ftill-length or mature native 
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sequence PROl 100 comprising amino acids 1 or 21 through 320 of Figure 216 (SEQ ID NO:299), the native 
sequence PR0836 is a full-length or mature native sequence PR0836 comprising amino acids 1 or 30 through 
461 of Figure 218 (SEQ ID NO:301), the native sequence PROl 141 is a mature or full-length native sequence 
PROl 141 comprising amino acids 1 to 247 of Figure 220 (SEQ ID NO:303)» the native sequence PROl 132 is 
a fiill-lengih or mature native sequence PROl 132 comprising amino acids 1 or about 23 through 293 of Figure 
5 226 (SEQ ID NO:309), the native sequence NL7 is a mature or full-lengih native sequence NL7 comprising 
amino acids from about position 51 to about position 461 of Figure 228 (SEQ ID NO:314), the native sequence 
PROl 131 is a full-length or mature native sequence PROl 131 comprising amino acids 1 through 280 of Figure 
230 (SEQ ID NO:319), the native sequence PR01281 is a full-length or mature native sequence PR01281 
comprising amino acids 1 or about 16 to 775 of Figure 233 (SEQ ID NO:326), the native sequence PRO1064 

10 is a manire or full-length native sequence PRO 1064 comprising amino acids 1 to 153 of Figure 235 (SEQ ID 
NO:334), the native sequence PR01379 is a full-length or mature native sequence PR01379 comprising amino 
acids 1 or about 18 to 574 of Figure 238 (SEQ ID NO:340), the native sequence PR0844 is a full-length or 
mature native sequence PR0844 comprising amino acids 1 or 20 dirough 1 1 1 of Figure 240 (SEQ ID NO:344), 
the native sequence PR0848 is a full-length or mamre native sequence PR0848 comprising amino acids 1 or 36 

IS through 600 of Figure 242 (SEQ ID NO:347), the native sequence PRO1097 is a full-length or mamre native 
sequence PRO1097 comprising amino acids 1 or 21 through 91 of Figure 244 (SEQ ID NO:349), the native 
sequence PROl 153 is a mature or fiilMength native sequence PROl 153 comprising amino acids 1 to 197 of 
Figure 246 (SEQ ID NO:351), the native sequence PROl 154 is a full-length or mamre native sequence PROl 154 
comprising amino acids 1 or 35 to 941 of Figure 248 (SEQ ID NO:353), the native sequence PROl 181 is a 

20 mamre or full-length native sequence PROl 181 comprising amino acids 1 to 437 of Figure 250 (SEQ ID 
NO:355), the native sequence PROl 182 is a mamre or full-length native sequence PROl 182 comprising amino 
acids 1 to 271 of Figure 252 (SEQ ID NO:357), the native sequence PROl 155 is a full-length native or mature 
sequence PROl 155 comprising amino acids 1 or 19 through 135 of Figure 254 (SEQ ID NO:359), the native 
sequence PROl 156 is a full-length or mature native sequence PROl 156 comprising amino acids 1 or about 23 

25 to 159 of Figure 256 (SEQ ID NO:361), the native sequence PRO1098 is a full-length or mamre native sequence 
PRO1098 comprising amino acids 1 or 20 through 78 of Figure 258 (SEQ ID NO:363), the native sequence 
PROl 127 is a full-length or mamre native sequence PROl 127 comprising amino acids 1 or about 30 through 
67 of Figure 260 (SEQ ID NO:365), the native sequence PROl 126 is a mature or full-length native sequence 
PROl 126 comprising amino acids 1 to 402 of Figure 262 (SEQ ID NO:367), the native sequence PROl 125 is 

30 a mature or full-length native sequence PROl 125 comprising amino acids 26 to 447 of Figure 264 (SEQ ID 
NO:369), the native sequence PROl 186 is a full-length or mature native sequence PROl 186 comprising amino 
acids 1 or about 20 through 105 of Figure 266 (SEQ ID NO:37 1 ), the native sequence PROl 198 is a full-length 
or mature native sequence PROl 198 comprising amino acids 1 or about 35 to 229 of Figure 268 (SEQ ID 
NO:373), the native sequence PROl 158 is a full-length or mamre native sequence PROl 158 comprising amino 
»35 acids 1 or about 20 to 123 of Figure 270 (SEQ ID NO:375)» the native sequence PROl 159 is a mature or full- 
length native sequence PROl 159 comprising amino acids 1 to 90 of Figure 272 (SEQ ID NO: 377), the native 
sequence PROl 124 is a mature or full-length native sequence PROl 124 comprising amino acids 22 through 919 
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' of Figure 274 (SEQ ID NO:379), the native sequence PR01287 is a mature or fiiU-length native sequence 
PR01287 comprising amino acids 1 to 532 of Figure 276 (SEQ ID NO:381), the native sequence PR01312 is 
a full-length or mature native sequence PR01312 comprising amino acids 1 or about 15 to 212 of Figure 278 
(SEQ ID NO:387), the native sequence PRO! 192 is a full-length or mature native sequence PROl 192 comprising 
^ amino acids 1 or about 22 to 215 of Figure 280 (SEQ ID NO:389), the native sequence PROl 160 is a manire 
S or full-length native sequence PROl 160 comprising amino acids 1 to 90 of Figure 282 (SEQ ID NO:394), the 
native sequence PRO 1 1 87 is a full-length or mature native sequence PRO 1 1 87 comprising amino acids 1 or about 
18 through 120 of Figure 284 (SEQ ID NO:399), the native sequence PROl 185 is a full-length or mature native 
sequence PROl 185 comprising amino acids 1 or about 22 through 198 of Figure 286 (SEQ ID N0:401). the 
native sequence PRO 1345 is a manire or full-length native sequence PRO 1345 comprising amino acids 1 to 206 

10 of Figure 288 (SEQ ID NO:403), the native sequence PR01245 is a full-length or manire native sequence 
PR01245 comprising amino acids 1 or about 19 to 104 of Figure 290 (SEQ ID NO:408), the native sequence 
PR01358 is a full-length or mature native sequence PR01358 comprising amino acids 1 or about 19 through 
444 of Figure 292 (SEQ ID NO:4I0), the native sequence PROl 195 is a full-length or manire native sequence 
PROl 195 comprising amino acids 1 or about 23 through 151 of Figure 294 (SEQ ID NO:412), the native 

15 sequence PRO1270 is a manire or full-length native sequence PRO1270 comprising amino acids 1 to 313 of 
Figure 296 (SEQ ID NO:414), the native sequence PR01271 is a manire or full-length native sequence PR01271 
comprising amino acids 1 to 208 of Figure 298 (SEQ ID NO:416), the native sequence PR01375 is a fiilMength 
or mature native sequence PR01375 comprising amino acids 1 through 198 of Figure 300 (SEQ ID N0:418), 
the native sequence PR01385 is a manire or full-length native sequence PR01385 comprising amino acids 1 to 

20 128 of Figure302 (SEQ ID NO:420), the native sequence PRO 1387 is a mamre or full-length native sequence 
PR01387 comprising amino acids 1 to 394 of Figure 304 (SEQ ID NO:422) and the native sequence PR01384 
is a full-length or mature native sequence PR01384 comprising amino acids 1 to 229 of Figiffe 306 (SEQ ID 
NO:424). Start and stop codons are shown in bold font and underlined in the figures. 

The PRO polypeptide "extracellular domain" or "ECD** refers to a form of the PRO polypeptide which 

25 is essentially free of the transmembrane and cytoplasmic domains. Ordinarily, a PRO polypeptide ECD will have 
less than 1 % of such transmembrane and/or cytoplasmic domains and preferably, will have less than 0.5% of 
such domains. It will be understood that any transmembrane domains identified for the PRO polypeptides of 
the present invention are identified pursuam to criteria routinely employed in the art for identifying that type of 
hydrophobic domain. The exact boundaries of a transmembrane domain may vary but most likely by no more 

30 than about 5 amino acids at either end of the domain as iniually identified. Optionally, therefore, an extracellular 
domain of a PRO polypeptide may contain from about 5 or fewer amino acids on either or the transmembrane 
domain as initially identified. 

"PRO polypeptide variant" means an active PRO polypeptide as defmed above or below having at least 
about 80% amino acid sequence identity with a full-length native sequence PRO polypeptide sequence as 

35 disclosed herein, a full-length native sequence PRO polypeptide sequence lacking the signal peptide as disclosed 
herein, an extracellular domain of a PRO polypeptide as disclosed herein or any other fragment of a full-length 
PRO polypeptide sequence as disclosed herein. Such PRO polypeptide variants include, for instance, PRO 
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polypqitides wherein one or more amino acid residues are added, or deleted, at the N- or C-termmus of the ftill- 
length native amino acid sequence. Ordinarily, a PRO polypeptide variant will have at least about 80% amino 
acid sequence identity, preferably at least about 81 % amino acid sequence identity, more preferably at least about 
82% amino acid sequence identity, more preferably at least about 83% amino acid sequence identity, more 
pieferably at least about 84% amino acid sequence identity, more preferably at least about 85% amino acid 

5 sequence identity, more preferably at least about 86% amino acid sequence identity, more preferably at least 
about 87% amino acid sequence identity, more preferably at least about 88% amino acid sequence identity, more 
preferably at least about 89% amino acid sequence identity, more preferably at least about 90% amino acid 
sequence identity, more preferably at least about 91 % amino acid sequence identity, more preferably at least 
about 92% amino acid sequence identity, more preferably at least about 93% amino acid sequence identity, more 

10 preferably at least about 94% amino acid sequence identity, more preferably at least about 95% amino acid 
sequence identity, more preferably at least about 96% amino acid sequence identity, more preferably at least 
about 97% amino acid sequence identity, more preferably at least about 98% amino acid sequence identity and 
most preferably at least about 99% amino acid sequence identity with the amino acid sequence of the full-length 
native amino acid sequence as disclosed herein. Ordinarily. PRO variant polypeptides are at least about 10 

15 amino acids in length, often at least about 20 amino acids in length, more often at least about 30 amino acids in 
length, more often at least about 40 amino acids in length, more often at least about 50 amino acids in length, 
more often at least about 60 amino acids in length, more often at least about 70 amino acids in length, more often 
at least about 80 amino acids in length, more often at least about 90 amino acids in length, more often at least 
about 100 amino acids in length, more often at least about 150 amino acids in length, more often at least about 

20 200 amino acids in length, more often at least about 300 amino acids in length, or more. 

"Percent {%) amino acid sequence identity" with respect to the PRO polypeptide sequences identified 
herein is defmed as the percentage of amino acid residues in a candidate sequence that are identical with the 
amino acid residues in the specific PRO polypeptide sequence, after aligning the sequences and introducing gaps, 
if necessary, to achieve the maximum percent sequence identity, and not considering any conservative 

25 substinitions as part of the sequence identity. Alignment for purposes of determining percent amino acid 
sequence identity can be achieved in various ways that are within the skill in the an, for instance, using publicly 
available computer software such as BLAST. BLAST-2, AUGN or Megalign (DNASTAR) software. Those 
skilled in the art can determine appropriate parameters for measuring alignment, including any algorithms needed 
to achieve maximal alignment over the ftill length of the sequences being compared. For purposes herein, 

30 however, % amino acid sequence identity values are generated using the WU-BLAST-2 computer program 
(Altschul et al.. Methods in Enzvmoloev 266:460-480 (1996)). Most of the WU-BLAST-2 search parameters 
are set to the default values. Those not set to default values, i.e., the adjustable parameters, are set with the 
following values: overlap span = 1. overlap fraction = 0.125. word threshold (T) = 11. and scoring matrix 
= BLOSUM62. For purposes herein, a % amino acid sequence identity value is determined by dividing (a) the 

35 number of matching identical amino acid residues between the amino acid sequence of the PRO polypeptide of 
interest having a sequence derived from the native PRO polypeptide and the comparison amino acid sequence 
of interest (i.e., the sequence against which the PRO polypeptide of interest is being compared which may be 
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a PRO variant polypeptide) as determined by WU-BLA5T-2 by (b) the total number of amino acid residues of 
the PRO polypeptide of interest. 

"PRO variant polynucleotide" or **PRO variant nucleic acid sequence" means a nucleic acid molecule 
which encodes an active PRO polypeptide as defined below and which has at least about 80% nucleic acid 
sequence identity with a nucleotide acid sequence encoding a full-length native sequence PRO polypeptide 
5 sequence as disclosed herein, a full-length native sequence PRO polypeptide sequence lacking the signal peptide 
as disclosed herein, an extracellular domain of a PRO polypeptide as disclosed herein or any other fragment of 
a full-length PRO polypeptide sequence as disclosed herein. Ordinarily, a PRO variant polynucleotide will have 
at least about 80% nucleic acid sequence identity, more preferably at least about 81% nucleic acid sequence 
identity, more preferably at least about 82% nucleic acid sequence identity, more preferably at least about 83% 

10 nucleic acid sequence identity, more preferably at least about 84% nucleic acid sequence identity, more 
preferably at least about 85% nucleic acid sequence identity, more preferably at least about 86% nucleic acid 
sequence identity,. more preferably at least about 87% nucleic acid sequence identity, more preferably at least 
about 88% nucleic acid sequence identity, more preferably at least about 89% nucleic acid sequence idemity, 
more preferably at least about 90% nucleic acid sequence identity, more preferably at least about 91 % nucleic 

IS acid sequence identity, more preferably at least about 92% nucleic acid sequence identity, more preferably at 
least about 93% nucleic acid sequence identity, more preferably at least about 94% nucleic acid sequence 
identity, more preferably at least about 95% nucleic acid sequence identity, more preferably at least about 96% 
nucleic acid sequence identity, more preferably at least about 97% nucleic acid sequence identity, more 
preferably at least about 98% nucleic acid sequence identity and yei more preferably at least about 99% nucleic 

20 acid sequence identity with the nucleic acid sequence encoding a full-length native sequence PRO polypeptide 
sequence as disclosed herein, a full-length native sequence PRO polypeptide sequence lacking the signal peptide 
as disclosed herein, an extracellular domain of a PRO polypeptide as disclosed herein or any other fragmeru of 
a full-length PRO polypeptide sequence as disclosed herein. Variants do not encompass the native nucleotide 
sequence. 

25 Ordinarily, PRO variant polynucleotides are at least about 30 nucleotides in length, often at least about 

60 nucleotides in length, more often at least about 90 nucleotides in length, more often at least about 120 
nucleotides in length, more often at least about 150 nucleotides in length, more often at least about 180 
nucleotides in length, more often at least about 210 nucleotides in length, more often at least about 240 
nucleotides in length, more often at least about 270 nucleotides in length, more often at least about 300 

30 nucleotides in length, more often at least about 450 nucleotides in length, more often at least about 600 
nucleotides in length, more often at least about 900 nucleotides in length, or more. 

"Percent (%) nucleic acid sequence identity" with respect to PRO-encoding nucleic acid sequences 
identified herein is defmed as the percentage of nucleotides in a candidate sequence that are identical with the 
nucleotides in the PRO nucleic acid sequence of interest, after aligning die sequences and introducing gaps, if 

35 necessary, to achieve the maximimi percent sequence identity. Alignment for purposes of determining percent 
nucleic acid sequence identity can be achieved in various ways that are within the skill in the art, for instance, 
using publicly available computer software such as BLAST, BLAST-2, ALIGN or Megalign (DNASTAR) 
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software. For purposes herein, however, % nucleic acid sequence identity values are generated using the WU- 
BLAST-2 computer program (Altschul et al., Methods in Enzvmology 266:46(M80 (1996)). Most of the WU- 
BLAST'2 search parameters are set to the default values. Those not set to default values, i.e., the adjustable 
parameters, are set with the following values: overlap span = 1 , overlap fraction = 0. 125, word threshold (T) 
= 11, and scoring matrix = BLOSUM62. For purposes herein, a % nucleic acid sequence identity value is 
5 determined by dividing (a) the number of matching identical nucleotides between the nucleic acid sequence of 
the PRO polypeptide-encoding nucleic acid molecule of interest having a sequence derived from the native 
sequence PRO polypeptide-encoding nulceic acid and the comparison nucleic acid moelcule of interest (i.e., the 
sequence against which the PRO polypeptide-encoding nucleic acid molecule of interest is being compared which 
may be a variant PRO polynucleotide) as determined by WU-BLAST-2 by (b) the total number of nucleotides 

10 of the PRO polypeptide-encoding nucleic acid molecule of interest. 

in other embodiments, PRO variant polynucleotides are nucleic acid molecules that encode an active 
PRO polypeptide and which are capable of hybridizing, preferably under stringent hybridization and wash 
conditions, to nucleotide sequences encoding a full-length PRO polypeptide as disclosed herein. PRO variant 
polypeptides may be those that are encoded by a PRO variant polynucleotide. 

IS The term "positives", in the context of sequence comparison performed as described above, includes 

residues in the sequences compared that are not identical but have similar properties (e.g. as a result of 
conservative substitutions, see Table 1 below). For purposes herein, the % value of positives is determined by 
dividing (a) the number of amino acid residues scoring a positive value beiwcen the PRO polypeptide amino acid 
sequence of interest having a sequence derived from the native PRO polypeptide sequence and the comparison 

20 amino acid sequence of interest (i.e., the amino acid sequence against which the PRO polypeptide sequence is 
being compared) as determined in the BLOSUM62 matrix of WU-BLAST-2 by (b) the total number of amino 
acid residues of the PRO polypeptide of interest. 

"Isolated, " when used to describe the various polypeptides disclosed herein, means polypeptide that has 
been identified and separated and/or recovered from a component of its natural environment. Contaminant 

25 components of its natural environment are materials that would typically interfere with diagnostic or therapeutic 
uses for the polypeptide, and may include enzymes, hormones, and other proteinaceous or non-proteinaceous 
solutes. In preferred embodiments, the polypeptide will be purified (1) to a degree sufficient to obtain at least 
15 residues of N-terminal or internal amino acid sequence by use of a spinning cup sequenator, or (2) to 
homogeneity by SDS-PAGE under non-reducing or reducing conditions using Coomassie blue or, preferably, 

30 silver stain. Isolated polypeptide includes polypeptide in situ within recombinant cells, since at least one 
component of the PRO polypeptide natural environment will not be present. Ordinarily, however, isolated 
polypeptide will be prepared by at least one purification step. 

An "isolated" PRO polypeptide-encoding nucleic acid is a nucleic acid molecule that is identified and 
separated from at least one contaminant nucleic acid molecule with which it is ordinarily associated in the natural 

35 source of the PRO polypeptide nucleic acid. An isolated PRO polypeptide nucleic acid molecule is other than 
in the form or setting in which it is found in nature. Isolated PRO polypeptide nucleic acid molecules therefore 
are distinguished from the specific PRO polypeptide nucleic acid molecule as it exists in natural cells. However, 
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an isolated PRO poiypqjtide nucleic acid molecule includes PRO polypeptide nucleic acid molecules contained 
in ceils thai ordinarily express the PRO polypeptide where, for example, the nucleic acid molecule is in a 
chromosomal location different from that of natural cells. 

The term "control sequences" refers to DNA sequences necessary for the expression of an operably 
linked coding sequence in a panicular host organism. The control sequences that are suitable for prokaryoies, 
5 for example, include a promoter, optionally an operator sequence, and a ribosome binding site. Eukaryotic cells 
are known to utilize promoters, polyadenyiatton signals, and enhancers. 

Nucleic acid is "operably linked" when it is placed into a functional relationship with another nucleic 
acid sequence. For example, DNA for a presequence or secretory leader is operably linked to DNA for a 
polypeptide if it is expressed as a preprotein that participates in the secretion of the polypeptide; a promoter or 

10 enhancer is operably linked to a coding sequence if it affects the transcription of the sequence; or a ribosome 
binding site is operably linked to a coding sequence if it is positioned so as to facilitate translation. Generally, 
"operably linked" means that the DNA sequences being linked are contiguous, and, in the case of a secretory 
leader, contiguous and in reading phase. However, enhancers do not have to be contiguous. Linking is 
accomplished by ligation at convenient restriction sites. If such sites do not exist, the synthetic oligonucleotide 

15 adaptors or linkers are used in accordance with conventional practice. 

The term "antibody" is used in the broadest sense and specifically covers, for example, single anti-PRO 
monoclonal antibodies (including agonist, antagonist, and neutralizing antibodies), anti-PRO antibody 
compositions with polyepitopic specificity, single chain anii-PRO antibodies, and fragments of anti-PRO 
antibodies (see below). The term "monoclonal antibody" as used herein refers to an antibody obtained from a 

20 population of substantially homogeneous antibodies, i.e., the individual antibodies comprising the population are 
identical except for possible namrally-occurring mutations that may be present in minor amotmts. 

"Stringency" of hybridization reactions is readily determinable by one of ordinary skill in the art, and 
generally is an empirical calculation dependent upon probe length, washing temperamre, and salt concentration. 
In general, longer probes require higher temperatures for proper annealing, while shoner probes need lower 

25 temperatures. Hybridization generally depenids on the ability of denatured DNA to reanneal when 
complementary strands are present in an environment below dieir melting temperature. The higher the degree 
of desired homology between the probe and hybridizable sequence, the higher the relative temperature which 
can be used. As a result, it follows that higher relative temperanires would tend to make the reaction conditions 
more stringent, while lower ten^ranires less so. For additional details and explanation of stringency of 

30 hybridization reactions, see Ausubel et al.. Current Protocols in Molecular Biolo2v. Wiley Interscience 
Publishers, (1995). 

"Su-ingent conditions" or "high stringency conditions", as defined herein, may be identified by those 
that: (1) employ low ionic strength and high temperature for washing, for example 0.015 M sodium 
chloride/0.0015 M sodium citrate/0.1% sodium dodecyl sulfate at 50**C; (2) employ during hybridization a 
35 denaturing agent, such as formamide, for example, 50% (v/v) formamide with 0.1% bovine sertun 
albumin/0. 1 % Ficoll/0. 1 % polyvinylpyrrolidone/50mMsodium phosphate buffer at pH 6.5 with 750 mM sodium 
chloride, 75 mM sodium ciu^te at 42*^0; or (3) employ 50% formamide, 5 x SSC (0.75 M NaCl, 0.075 M 
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sodium citrate), 50 mM sodium phosphate (pH 6.8), 0.1 % sodium pyrophosphate, 5 x Denhardrs solution, 
sonicated salmon sperm DNA (50 ^g/ml), 01 % SDS, and 10% dexiran sulfate at 42**C. with washes at 42**C 
in 0.2 X SSC (sodium chloride/sodium ciu-ate) and 50% formamide at SS^'C. followed by a high-stringency wash 
consisting of 0. 1 x SSC conuining EDTA ai SS^'C. 

"Moderately suingent conditions" may be identified as described by Sambrook ei al.. Molecular 

5 Cloning: A Laboratory Manual. New York: Cold Spring Harbor Press, 1989, and include the use of washing 
solution and hybridization conditions (e.g., temperature, ionic strength and %SDS) less stringent that those 
described above. An example of moderately su^ingent conditions is overnight incubation at 37**C in a solution 
comprising: 20% formamide, 5 x SSC (150 mM NaCl, 15 mM trisodium citrate), 50 mM sodium phosphate (pH 
7.6), 5 X Denhardt's solution, 10% dextran sulfate, and 20 mg/ml denanired sheared salmon sperm DNA, 

10 followed by washing the filters in 1 x SSC at about 37-50**C. The skilled artisan will recognize how to adjust 
the temperature, ionic strength, etc. as necessary to accommodate factors such as probe length and the like. 

The term "epitope tagged" when used herein refers to a chimeric polypeptide comprising a PRO 
polypeptide fused to a "tag polypeptide". The tag polypeptide has enough residues to provide an epitope against 
which an antibody can be made, yet is short enough such that it does not interfere with activity of the polypeptide 

15 to which it is fused. The tag polypeptide preferably also is fairly unique so that the antibody does not 
substantially cross-react with other epitopes. Suitable tag polypeptides generally have at least six amino acid 
residues and usually between about 8 and 50 amino acid residues (preferably, between about 10 and 20 amino 
acid residues). 

As used herein, the term "immunoadhesin" designates antibody-like molecules which combine the 
20 binding specificity of a heterologous protein (an "adhesin") with the effector functions of immunoglobulin 
constant domains. Sirucmrally, the immunoadhesins comprise a fusion of an amino acid sequence with the 
desired binding specificity which is other than the antigen recognition and binding site of an antibody (i.e., is 
"heterologous"), and an immunoglobulin constant domain sequence. The adhesin part of an immunoadhesin 
molecule typically is a contiguous amino acid sequence comprising at least the binding site of a receptor or a 
25 ligand. The immimoglobulin constant domain sequence in die immunoadhesin may be obtained from any 
immunoglobulin, such as IgG-1, IgG-2, IgG-3, or lgG-4 subtypes, IgA (including IgA-1 and IgA-2), IgE, IgD 
or IgM. 

"Active" or "activity" for the purposes herein refers to form(s) of a PRO polypeptide which retain a 
biological and/or an immunological activity of native or naturally -occurring PRO, wherein "biologicaP activity 

30 refers to a biological function (cither inhibitory or stimulatory) caused by a native or naturally-occurring PRO 
other than the ability to induce the production of an antibody against an antigenic epitope possessed by a native 
or namrally-occurring PRO and an "immunological" activity refers to the ability to induce the production of an 
antibody against an antigenic epitope possessed by a native or naturally-occurring PRO. 

The term "antagonist" is used in the broadest sense, and includes any molecule that partially or fully 

35 blocks, inhibits, or neutralizes a biological activity of a native PRO polypeptide disclosed herein. In a similar 
manner, the term "agonist" is used in the broadest sense and includes any molecule that mimics a biological 
activity of a native PRO polypeptide disclosed herein. Suitable agonist or antagonist molecules specifically 
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include agonisi or aiuagonisi antibodies or antibody fragments, fragments or amino acid sequence variants of 
native PRO polypeptides, peptides, small organic molecules, etc. Methods for identifying agonists or 
antagonists of a PRO polypeptide may comprise contacting a PRO polypeptide with a candidate agonist or 
antagonist molecule and measuring a detectable change in one or more biological activities normally associated 
with the PRO polypeptide. 

5 "Treatment* refers to both therapeutic treatment and prophylactic or preventative measures, wherein 

the object is to prevent or slow down (lessen) the targeted pathologic condition or disorder. Those in need of 
treatment include those already with the disorder as well as those prone to have the disorder or those in whom 
the disorder is to be prevented. 

*Chronic" administration refers to administration of the agent(s) in a continuous mode as opposed to 
10 an acute mode, so as to maintain the initial therapeutic effect (activity) for an extended period of time. 
"Ittterminent" administration is treatment that is not consecutively done without interruption, but rather is cyclic 
in nature. 

"Mammal" for purposes of treatment refers to any animal classified as a manunai, including humans, 
domestic and farm animals, and zoo, sports, or pet animals, such as dogs, cats, caule, horses, sheep, pigs, goats, 

15 rabbits, etc. Preferably, the manmial is human. 

Administration "in combination with' one or more fiinher therapeutic agents includes simultaneous 
(concurrent) and consecutive administration in any order. 

•Carriers" as used herein include pharmaceutically acceptable carriers, excipients, or stabilizers which 
are nontoxic to the cell or mammal being exposed thereto at the dosages and concentrations employed. Often 

20 the physiologically acceptable carrier is an aqueous pH buffered solution. Examples of physiologically 
acceptable carriers include buffers such as phosphate, citrate, and other organic acids; antioxidants including 
ascorbic acid; low molecular weight (less than about 10 residues) polypeptide; proteins, such as serum albumin, 
gelatin, or immunoglobulins; hydrophilic polymers such as polyvinylpyrrolidone; amino acids such as glycine, 
glutamine, asparagine, arginine or lysine; monosaccharides, disaccharides, and other carbohydrates including 

25 glucose, mannose, or dextrins; chelating agents such as EDTA; sugar alcohols such as mannitol or sorbitol; salt- 
forming counterions such as sodium; and/or nonionic surfactants such as TWEEN™, polyethylene glycol (PEG), 
and PLURONICS* 

"Antibody fragments" comprise a portion of an intact antibody, preferably the antigen binding or 
variable region of the intact antibody. Examples of antibody fragments include Fab, Fab\ F(ab')3, and Fv 
30 fragments; diabodies; linear antibodies (Zapata et al.. Protein Eng. 8(10): 1057-1062 [1995]); single-chain 
antibody molecules; and multispecific antibodies formed from antibody fragments. 

Papain digestion of antibodies produces two identical antigen-bmding fragments, called "Fab" 
fragments, each with a single antigen-binding site, and a residual "Fc" fiagment, a designation reflecting the 
ability to crystallize readily. Pepsin treatment yields an F(ab')3 fragment that has two antigen-combining sites 
35 and is still capable of cross-linking antigen. 
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"Fv" is the minimum antibody fragment which contains a complete aniigen-recognition and -binding 
site. This region consists of a dimer of one heavy- and one light-chain variable domain in tight, non-covalent 
association. It is in this configuration that the three CDRs of each variable domain interact to defme an antigen- 
binding site on the surface of the VH-VL dimer. Collectively, the six CDRs confer antigen-binding specificity 
to the antibody. However, even a single variable domain (or half of an Fv comprising only three CDRs specific 
5 for an andgen) has the ability to recognize and bind antigen, although at a lower affinity than the entire binding 
site. 

The Fab fragment also contains the constant domain of the light chain and the first constant domain 
(CHI) of the heavy chain. Fab fragments differ from Fab* fragments by the addition of a few residues at the 
carboxy terminus of the heavy chain CHI domain including one or more cysteines from the antibody hinge 
10 region. Fab*-SH is the designation herein for Fab* in which the cysteine residue(s) of the constant domains bear 
a free thiol group. F(ab*)2 antibody fragments originally were produced as pairs of Fab* fragments which have 
hinge cysteines between them. Other chemical couplings of antibody fragments are also known. 

The *'light chains" of antibodies (inuntmoglobulins) from any vertebrate species can be assigned to one 
of tv/o clearly distinct types, called kappa and lambda, based on the amino acid sequences of their constant 
IS domains. 

Depending on the amino acid sequence of the constant domain of their heavy chains, immunoglobulins 
can be assigned to different classes. There are five major classes of immunoglobulins: IgA, IgD. IgE, IgG, and 
IgM, and several of these may be further divided into subclasses (isotypes), e.g. , IgG 1 , IgG2, IgG3, lgG4, IgA, 
and IgA2. 

20 "Single-chain Fv" or "sFv" antibody fragments comprise the VH and VL domains of antibody, wherein 

these domains are present in a single polypeptide chain. Preferably, the Fv polypeptide further comprises a 
polypeptide linker between the VH and VL domains which enables the sFv to form the desired structure for 
antigen binding. For a review of sFv, see Fiuckthim in The Pharmacology of Monoclonal Antibodies, vol. 1 13, 
Rosenburg and Moore eds.. Springer- Verlag, New York, pp. 269-315 (1994). 

25 The term "diabodies" refers to small antibody fragments with two antigen-binding sites, which 

fragments comprise a heavy-chain variable domain (VH) connected to a light-chain variable domain (VL) in the 
same polypeptide chain (VH - VL). By using a linker that is too short to allow pairing between die two domains 
on the same chain, the domains are forced to pair with the complementary domains of another chain and create 
two antigen-binding sites. Diabodies are described more fully in, for example, EP 404,097; WO 93/1 1161 ; and 

30 Hollinger et al. . Proc. Natl. Acad. Sci. USA. 90:6444-6448 (1993). 

An ''isolated" antibody is one which has been identified and separated and/or recovered from a 
component of its natural environment. Contaminant components of its natural environment are materials which 
would interfere with diagnostic or therapeutic uses for the antibody, and may include enzymes, hormones, and 
other proieinaceous or nonproteinaceous solutes. In preferred embodiments, the antibody will be purified (1) 

35 to greater dian 95% by weight of antibody as determined by the Lowry method, and most preferably more than 
99% by weight, (2) to a degree sufficient to obtain at least 15 residues of N-terminal or internal amino acid 
sequence by use of a spinning cup sequenator, or (3) to homogeneity by SDS-PAGE under reducing or 
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nonreducing conditions using Coomassie blue or, preferably, silver stain. Isolated antibody includes the antibody 
in situ within recombinant cells since at least one component of the antibody's natural environment will not be 
present. Ordinarily, however, isolated antibody will be prepared by at least one purification step. 

Hie word "label* when used herem refers to a detectable compound or composition which is conjugated 
directly or indirectly to the antibody so as to generate a 'labeled" antibody. The label may be detectable by itself 
5 (e.g. radioisotope labels or fluorescent labels) or, in the case of an enzymatic label, may catalyze chemical 
alteration of a substrate compound or composition which is detectable. 

By "solid phase" is meant a non-aqueous matrix to which the antibody of the present invention can 
adhere. Examples of solid phases encompassed herein mclude those formed panially or entirely of glass (e.g. , 
controlled pore glass), polysaccharides (e.g., agarose), polyacrylamides, polystyrene, polyvinyl alcohol and 
10 silicones. In certain embodiments, depending on the context, the solid phase can comprise the well of an assay 
plate; in others it is a purification colunm (e.g., an affinity chromatography column). This term also includes 
a discontinuous solid phase of discrete particles, such as those described in U.S. Patent No. 4,275,149. 

A "liposome" is a small vesicle composed of various types of lipids, phospholipids and/or surfactant 
which is useful for delivery of a drug (such as a PRO polypeptide or antibody thereto) to a mammal. The 
15 components of the liposome are conunonly arranged in a bilayer formation, similar to the lipid arrangement of 
biological membranes. 

A ''small molecule** is defined herein to have a molecular weight below about 500 Daltons. 

II. ComiK>sitions and Methods of the Invention 

20 The present invention provides newly identified and isolated nucleotide sequences encoding polypeptides 

referred to in the present application as PRO polypeptides. In panicular, cDNAs encoding various PRO 
polypeptides have been identified and isolated, as disclosed in further detail in the Examples below. It is noted 
that proteins produced in separate expression rounds may be given different PRO numbers but the UNQ number 
is unique for any given DNA and the encoded protein, and will not be changed. However, for sake of 

25 simplicity, in the present specification the protein encoded by the full length native nucleic acid molecules 
disclosed herein as well as all further native bomologues and variants included in the foregoing definition of 
PRO, will be referred to as ''PRO/number", regardless of their origin or mode of preparation. 

As disclosed in the Examples below, various cDNA clones have been deposited with the ATCC. The 
actual nucleotide sequences of those clones can readily be determined by the skilled artisan by sequencing of the 

30 deposited clone using routine methods in the art. The predicted amino acid sequence can be determined from 
the nucleotide sequence using routine skill. For the PRO polypeptides and encoding nucleic acids described 
herein, Applicants have identified what is believed to be the reading frame best identifiable with the sequence 
information available at the time. 

35 1. FulNength PR0281 Polypeptides 

The present invention provides newly identifiedand isolated nucleotide sequences encoding polypeptides 
referred to in the present application as PR0281 (UNQ244). In panicular, cDNA encoding a PR0281 
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polypeptide has been identified and isolated, as disclosed in further detail in the Examples below. 

Using the WU-BLAST-2 sequence alignment computer program, it has been found that a fiill-lengih 
native sequence PR0281 (shown in Figure 2 and SEQ ID NO:2) has cenain amino acid sequence identity with 
the rat TEGT protein. Accordingly, it is presently believed that PR0281 disclose in the present application is 
a newly identified TEGT homolog and may possess activity typical of that protein. 

5 

2. FulMeneth PRQ276 PolvDeptides 
The present invention provides newly identified and isolated nucleotide sequencesencoding polypeptides 
referred to in the present application as PR0276 (UNQ243). In panicular, cDNA encoding a PR0276 
polypeptide has been identified and isolated, as disclosed in further detail in the Examples below. 
10 As far as is known, the DNA16435-1208 sequence encodes a novel factor designated herein as PR0276; 

using WU-BLAST-2 sequence alignment computer programs, no significant sequence identities to any known 
proteins were revealed. The sequence identity identifications which were found are listed below in the examples. 



3. Full-lenpth PR0189 Polypeptides 

IS The present invention provides newly identifiedand isolatednucleotide sequences encoding polypeptides 

referred to in the present application as PRO 189. In particular. Applicants have identified and isolated cDNA 
encoding a PRO 189 polypeptide, as disclosed in funher detail in the Examples below. To Applicants present 
knowledge, the DNA21624-1391 nucleotide sequence encodes a novel factor; using BLAST and FastA sequence 
alignment computer programs, no significant sequence identities to any known proteins were revealed. 

20 

4. Full-length PRO190 Polypeptides 

The present invention provides newly identified and isolated nucleotide sequences encoding polypeptides 
referred to in the present application as PRO 190. In particular. Applicants have identified and isolated cDNA 
encoding a PRO190 polypeptide, as disclosed in further detail in the Examples below. The PR0190-encoding 
25 clone was isolated from a hiunan retina library. To Applicants present knowledge, the DNA23334-1392 
nucleotide sequence encodes a novel multiple transmembrane spanning protein; using BLAST and FastA 
sequence alignment computer programs, there is some sequence identity with CMP-sialicacid and UDP-galactose 
transporters, indicating that PRO190 may be related to transponer or that PRO190 may be a novel transporter. 

30 

5. FulMength PR0341 Polypeptides 

The present invention provides newly identified and isolatednucleotide sequences encoding polypeptides 
referred to in the present application as PR0341 (UNQ300). in particular. cDNA encoding a PR0341 
polypeptide has been identified and isolated, as disclosed in further detail in the Examples below. 
35 The DNA26288-1239 clone was isolated from a human placenta library. As far as is known, the 

DNA26288-1239 sequence encodes a novel factor designated herein as PR0341; using the WU-BLAST-2 
sequence alignment computer program, no significant sequence identities to any known proteins were revealed. 
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6. FulMength PRO180 Poivpeptides 

The present inveniion provides newly identified and isolated nucleotide sequences encoding polypeptides 
referred to in the present application as PRO180 (UNQ154). In particular, cDNA encoding a PRO180 
polypeptide has been identified and isolated, as disclosed in further detail in the Examples below. 

The DNA26843-1389 clone was isolated from a hiunan placenta library using oligos formed from 
5 DNA12922 isolated from an amylase screen. As far as is known, the DNA26843-1389 sequence encodes a novel 
factor designated herein as PROiSO. 

7. FulMength PR0194 Polypeptides 

The present invention provides newly identified and isolated nucleotidesequencesencoding polypeptides 
10 referred to in the present application as PR0194. In particular. Applicants have identified and isolated cDNA 
encoding a PR0194 polypeptide, as disclosed in further detail in the Examples below. The PR0194-encoding 
clone was isolated from a human fetal limg library. To Applicants present knowledge, the DNA26844-I394 
nucleotide sequence encodes a novel factor; using BLAST and FasiA sequence alignment computer programs, 
no significant sequence identities to any known proteins were revealed. 

15 

8- FulMength PRO203 Polypeptides 

The present invention provides newly identifiedand isolated nucleotide sequences encoding polypeptides 
referred to in the present application as PRO203. In particular. Applicants have identified and isolated cDNA 
encoding a PRO203 polypeptide, as disclosed in further detail in the Examples below. Using BLAST and FasiA 
20 sequence alignment computer programs. Applicants found that the PRO203 polypeptide has sequence identity 
with GST ATPase. Accordingly, it is presently believed that PRO203 polypeptide disclosed in the present 
application is a newly identified member of the ATPase family and possesses activity typical of the GST ATPase, 

9. FulMength PRO290 Polypeptides 
25 The present inventicmprovidesnewly identified and isolated nucleotide sequences encoding polypeptides 

referred to in the present application as PRO290. In particular, cDN A encoding a PRO290 polypeptide has been 
identified and isolated, as disclosed in further detail in the Examples below. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 23 (SEQ ID NO:33), revealed sequence identities 
30 between the PRO290 amino acid sequence and the following Dayhoff sequences:P_R99800, CC4H_HUMAN. 
YCS2_,YEAST. CEF35G12.13, HSFAN.l, MMU52461.1, MMU70015.1, HSU67615.1, CETOlHlOJand 
CELT28F2^6. 

It is currently believed that PRO290 is an intracellular protein related to one or more of the above 

proteins. 

35 
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10. FulMen^h PR0874 Poivpcptides 

The present invention provides newly idemifsed and isolated nucleotide sequences encoding polypeptides 
referred to in the present application as PR0874. In panicular. Applicants have identified and isolated cDNA 
encoding a PR0874 polypeptide, as disclosed in further detail in the Examples below. The PR0874-encoding 
clone was isolated from a human fetal lung library. To Applicants present knowledge, the DNA4062 1-1440 
S nucleotide sequence encodes a novel factor. Although, using BLAST and FastA sequence alignment computer 
programs, some sequence identity with known proteins was revealed. 

11. Full-length PRO710 Polypeptides 

The present invention provides newly identifled and isolated nucteotidesequences encoding polypeptides 
10 referred to in the present application as PRO710. In particular, Applicants have identified and isolated cDNA 
encoding a PRO710 polypeptide, as disclosed in further detail in the Examples below. Using BLAST and FastA 
sequence alignment computer programs. Applicants found that the PR07 10 polypeptide has significant similarity 
to the CDC4S protein. Accordingly, it is presently believed that PRO710 polypeptide disclosed in the present 
application is a newly identified CDC45 homoiog. 

15 

12. FulHength PR01151 Polypeptides 

The present invention provides newly identifiedand isolated nucleotide sequences encoding polypeptides 
referred to in the present application as PROl 151. In particular. cDNA encoding a PROl 151 polypeptide has 
been identified and isolated, as disclosed in further detail in the Examples below. 
20 Using the WU-BLAST-2 sequence alignment computer program, it has been found that a fiill-lengih 

native sequence PROl 151 (shown in Figure 30 and SEQ ID NO:47) has certain amino acid sequence identity 
with the human 30 kD adipocyte complement-related precursor protein (ACR3_HUMAN). Accordingly, it is 
presently believed that PR01151 disclosed in the present application is a newly identified member of the 
complement protein family and may possess activity typical of that family. 

25 

13. FulMcngth PR01282 Polypeptides 

The present inventionprovides newly identified and isolated nucleotide sequences encoding polypeptides 
referred to in the present application as PRO 1282. In particular, cDNA encoding a PRO 1282 polypeptide has 
been identified and isolated, as disclosed in further deuil in the Examples below. 
30 As far as is known, the DNA45495-1550 sequence encodes a novel factor designated herein as 

PROI282. Using WU-BLAST-2 sequence alignment computer programs, some sequence identities between 
PR01282 and other leucine rich repeat proteins were revealed, as discussed in the examples below, indicating 
that a novel member of the leucine rich repeat superfamily has been identified. 

35 14. Full-length PR0358 Polypeptides 

The present invention further provides newly identified and isolated nucleotide sequences encoding a 
polypeptide referred to in the present application as PR0358. In panicular. Applicants have identified and 
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isolated cDNA encoding a novel human Toll polypeptide (PR0358), as disclosed in further detail in the 
Examples below. Using BLAST and FasiA sequence alignment computer programs. Applicants found that the 
coding sequence of PR0358 shows significant homology to DNA sequences HSU88540 1, HSU88878_1, 
HSU88879 ^1. HSU88880_^1, HS88881_1, and HSU79260_1 in the GenBank database. With the exception 
of HSU79260_l, die noted proteins have been identified as human toll-like receptors. 

Accordingly, it is presently believed that the PR0358 proteins disclosed in the present application are 
newly identified himian homologues of the Drosophila protein Toll, and are likely to play an important role in 
adaptive immunity. More specifically. PR0358 may be involved in inflammation, septic shock, and response 
to pathogens, and play possible roles in diverse medical conditions that are aggravated by immune response, such 
as, for example, diabetes, ALS, cancer, rheumatoid anhritis, and ulcers. The role of PR0385 as pathogen 
pattern recognition receptors, sensing die presence of conserved molecular structures present on microbes, is 
further supponed by the data disclosed in the present application, showing diat a known human Toll-like 
receptor, TLR2 is a direct mediator of LPS signaling. 

15. Full-length PROI310 Polypeptides 

The present invention provides newly identified and isolated nucleotide sequences encoding polypeptides 
referred to in the present application as PRO1310. In panicular, cDNA encoding a PRO1310 polypeptide has 
been identified and isolated, as disclosed in further detail in the Examples below. 

Using WU-BLAST-2 sequence alignment computer programs, it has been foiuid that a full-length native 
sequence PRO 13 10 (shown in Figure 36 and SEQ ID NO: 62) has cenain amino acid sequence identity with 
carboxypepiidase X2. Accordingly, it is presently believed thai PRO1310 disclosed in the present application 
is a newly identified member of die carboxypeptidase family and may possess carboxyl end amino acid removal 
activity. 

16. Full-length PR0698 Polypeptides 

The present invention provides newly identified and isolated nucleotide sequences encoding polypeptides 
referred to in the present s^licaxion as PR0698. In particular. Applicants have identified and isolated cDNA 
encoding a PR0698 polypeptide, as disclosed in further detail in die Examples below. Using BLAST and FastA 
sequence alignmem computer programs. Applicants found Uiat die PR0698 polypeptide has significant similarity 
to die olfactomedin protein. Accordingly, it is presently believed diat PR0698 polypeptide disclosed in the 
present application may be a newly identified olfactomedin homolog.. 

17. Full-length PR0732 Polypeptides 

The present invention provides newly identified and isolated nucleotide sequences .encoding polypeptides 
referred to in die present application as PR0732. In panicular. Applicants have identified and isolated cDNA 
encoding a PR0732 polypeptide, as disclosed in further detail in die Examples below. Using BLAST and FastA 
sequence alignment computer programs, Applicants found that die PR0732 polypeptide has significant similarity 
to the human placental Diff33 protein. Accordingly, it is presendy believed diat PR0732 polypeptide disclosed 
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in the present application is a newly identified Diff33 bomolog. 

18. FulMength PRO1120 Polypeptides 

The present invention provides newly identified and isolated nucleotide sequences encoding polypeptides 
referred to in the present application as PROl 120. in particular, cDNA enccxling a PROl 120 polypeptide has 
5 been identified and isolated, as disclosed in further detail in the Examples below. 

Using WU-BLAST-2 sequence alignment computer programs, it has been foimd that a full-length native 
sequence PROl 120 (shown in Figure 47 and SEQ ID NO:84) has certain amino acid sequence identity with the 
known sulfatase proteins designated CELK09C4_] , and GL6S_HUM AN, respectively, in the Dayhoff database 
(version 35.45 SwissProt 35). Accordingly » it is presently believed chat PROl 120 disclosed in the present 
10 application is a newly identified member of the sulfatase family and may possess activity typical of sulfatases. 

19. FulMength PROS37 PolYpeptides 

The present invention provides newly identified and isolated nucleotide sequences encoding polypeptides 
referred to in the present application as PR0537. In particular, cDNA encoding a PR0537 polypeptide has been 

15 identified and isolated, as disclosed in further detail in the Examples below. The DNA49 14 1-1431 clone was 
isolated from a human placenta library using a trapping technique which selects for nucleotide sequences 
encoding secreted proteins. Thus, the DNA4914 1-1431 clone does encode a secreted factor. As far as is 
known, the DNA49141-1431 sequence encodes a novel factor designated herein as PR0537; using the WU- 
BLAST2 sequence alignment computer program, no significant sequence identities to any known proteins were 

20 revealed. 

20. Full-length PR0536 Polypeptides 

The present invention providesnewly identified and isolated nucleotide sequences encoding polypeptides 
referred to in the present application as PR0536. In particular, cDNA encoding a PR0536 polypeptide has been 
25 identified and isolated, as disclosed in further detail in the Examples below. 

The DNA49142-1430 clone was isolated from a human infant brain library using a trapping technique 
which selects for nucleotide sequences encoding secreted proteins. Thus, the DNA49142-1430 clone does 
encode a secreted factor. As far as is known, the DNA49142-I430 sequence encodes a novel factor designated 
herein as PR0536; using the WU-BLAST-2 sequence alignment computer program, no significant sequence 
30 identities to any known proteins were revealed. 

21. Full-length PR0535 Polypeptides 

The present invention provides newly identified and isolated nucleotide sequences encoding polyi)eptides 
referred to in the present application as PR0535. In particular, cDNA encoding a PR0535 polypeptide has been 
35 identified and isolated, as disclosed in further detail in the Examples below. 

Using the WU-BLAST2 sequence alignment computer program, it has been found that a fiilMength 
native sequence PR0535 (shown in Figure 53 and SEQ ID NO:99) has amino acid sequence identity with a 
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putative peptidyl-prolyl isomerasc protein. Accordingly, it is presently believed that PR0535 disclosed in the 
present application is a newly identified member of the isomerase protein family and may possess activity typical 
of those proteins. 

22. FulHeneth PR0718 Polvi>eptides 
5 The present invention provides newly identified and isolated nucleotide sequencesencodingpolypeptides 

referred to in the present application as PR0718. In particular. Applicants have identified and isolated cDNA 
encoding a PR0718 polypeptide, as disclosed in further detail in the Examples below. The PR0718-cncoding 
clone was isolated from a human fetal lung library. To Applicants present knowledge, the DNA49647-1398 
nucleotide sequence encodes a novel factor; using BLAST and FastA sequence alignment computer programs, 
10 no significant sequence identities to any known proteins were revealed. 

23- FulHength PR0872 Polvpeptidcs 
The present invention provides newly identified and isolatednucleotidesequences encoding polypeptides 
referred to in the present plication as PR0872. In particular. Applicants have identified and isolated cDNA 
15 encoding a PR0872 polypeptide, as disclosed in further deuil in the Examples below. Using BLAST and FastA 
sequence alignment computer programs. Applicants fomid that the PR0872 polypeptide has sequence identity 
with dehydrogenases. Accordmgly, it is presently believed that PR0872 polypeptide disclosed in the present 
application is a newly identified member of the dehydrogenase family and possesses dehydrogenase activity. 

20 24. Full-length PRQ1063 Polypeptides 

The present invention provides newly identifiedand isolated nucleotide sequences encoding polypeptides 
referred to in the present application as PRO 1063. In panicuiar. Applicants have identified and isolated cDNA 
encoding a PRO1063 polypeptide, as disclosed in further detail in the Examples below. Using BLAST and 
FastA sequence alignment computer programs. Applicants found that the PRO 1063 polypeptide has significant 
25 similarity to the human type IV coUagenase protein. Accordingly, it is presently believed that PRO 1063 
polypeptide disclosed in the present application is a newly identified coUagenase homolog. 

25. Full-length PR0619 Poivoeptides 
The present inventionprovides newly identified and isolated nucleotide sequences encoding polypeptides 
30 referred to in the present application as PR0619. hi particular, cDNA encoding a PR06i9 polypeptide has been 
identified and isolated, as disclosed in further detail in the Examples below. 

Using WU-BLAST-2 sequence alignment computer program, it has been found that a full-length native 
sequence PR0619 (shown in Figure 68 and SEQ ID NO: 117) has certain amino acid sequence identity with 
VprcB3. Accordingly, u is presently believed that PR0619 disclosed in the present application is a newly 
35 identified member of the IgG superfamily and may possess activity related to the assembly and/or components 
of the surrogate light chain associated with developing B ceUs. 
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26. FulMength PR0943 PoivDeptides 
The present invention provides newly identified and isolated nucleotide sequences encoding polypeptides 
referred to in the present application as PR0943. In particular, cDNA encoding a PR0943 polypeptide has been 
identified and isolated, as disclosed in further detail in the E^iamples below. 

Using the WU-BLAST-2 sequence alignment computer program, it has been found that a full-length 
5 native sequence PR0943 (shown in Figure 70 and SEQ ID NO: 1 19) has amino acid sequence identity with the 
fibroblast growth factor receptor-4 protein. Accordingly, it is presently believed that PR0943 disclosed in the 
present application is a newly identified member of the fibrobalst growth factor receptor family and may possess 
activity typical of that family. 



10 27. Fuil-length PR01188 Polvpeptidcs 

The present invention provides newly identified and isolated nucleotidesequences encoding polypeptides 
referred to in the present application as PROl 188. In particular, cDNA encoding a PROl 188 polypeptide has 
been identified and isolated, as disclosed in further detail in the Examples below. 

As discussed in more detail in Example 1 below, using WU-BLAST-2 sequence alignment computer 
IS programs, it has been found that a full-length native sequence PROl 188 (shown in Figure 72; SEQ ID NO: 124) 
has certain amino acid sequence identity with nucleotide pyrophosphohydrolase (SSU831 14_1). Accordingly, 
it is presently believed that PROl 188 disclosed in the present application is a newly identified member of the 
nucleotide pyrophosphohydrolase family and may possess activity typical of that family of proteins. 



20 28. Full-length PROl 133 Polvpeptides 

The present invention provides newly identified and isolated nucleotide sequences encoding polypeptides 
referred to in the present application as PROl 133. In panicular, cDNA encoding a PROl 133 polypeptide has 
been identified and isolated, as disclosed in funher detail in the Examples below. 

Using WU-BLAST-2 sequence alignment computer programs, it has been foimd that a full-length native 
25 sequence PROl 133 (shown in Figure 74 and SEQ ID NO: 129) has certain amino acid sequence identity with 
netrin la, Dayhoff accession AF(X)2717_1 . Accordingly, it is presently believed that PROl 133 disclosed in the 
present application shares at least one related mechanism with netrin. 



29. Full-length FR0784 PolvpcDtides 

30 The present invention provides newly identified and isolated nucleotide sequences encoding polypeptides 

referred to in the present application as PR0784. In panicular, cDNA, designated herein as "DNA53978-1443", 
which encodes a PR0784 polypeptide, has been identified and isolated, as disclosed in further detail in the 
Examples below. 

Using BLAST and FastA sequence alignment computer programs, it has been found that a full-length 
35 native sequence PR0784 (shown in Figure 76 and SEQ ID NO: 135) has cenain amino acid sequence identity 
with sec22 homologs. Accordingly, it is presently believed that PR0784 disclosed in the present application is 
a newly identified member of the sec22 family and may possess vesicle trafficking activities typical of the sec22 
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family. 

30. FulMength PR0783 Poivpeptides 
The present invention provides newly identified and isolated nucleotide sequences encodingpolypepiides 
referred to in the present application as PR0783. In particular. Applicants have identified and isolated cDNA 
5 encoding a PR0783 polypeptide, as disclosed in further detail in the Examples below. The PR0783-cncoding 
clone was isolated from a human fetal kidney library. To Applicants present knowledge, the DNA53996-1442 
nucleotide sequence encodes a novel factor. However, using BLAST and FasiA sequence alignment computer 
programs, some sequence identity to known proteins was found. 

10 31. Full-length PRO820 Polvpeptides 

The present invention provides newly identified and isolated nucleotidesequences encoding polypeptides 
referred to in the present application as PRO820. In particular. Applicants have identified and isolated cDNA 
encoding a PRO820 polypeptide, as disclosed in further detail in the Examples below. Using BLAST and FastA 
sequence alignment computer programs. Applicants found that various portions of the PRO820 polypeptide have 

15 sequence identity widi the low affinity immunoglotnilin gamma Fc receptor, the IgE high affinity Fc receptor 
and the high affinity inununoglobulin epsilon receptor. Accordingly, it is presently believed that PRO820 
polypeptide disclosed in the present application is a newly identified member of the Fc receptor family. 

32. FulMength PRO1080 Polvpeotides 

20 The present invention provides newly identified and isolated nucleotide sequences encodingpolypepiides 

referred to in the present application as PRO1080. In panicuiar. Applicants have identified and isolated cDNA 
encoding a PRO 1080 polypeptide, as disclosed in further detail in the Examples below. Using BLAST and 
FastA sequence alignment computer programs. Dayhoff database (version 35,45 SwissProt 35), Applicants found 
that the PRO1080 polypeptide has sequence identity with a 39.9 kd protein designated as "YRYI CAEEL". a 

25 DnaJ homolog designated •AF027149_5". a DnaJ homolog 2 designated "RNU95727_r, and Dna3/Cpr3 
designated "AF011793_r. Accordingly, these results indicate that the PRO 1080 polypeptide disclosed in the 
present application may be a newly identified member of the DnaJ-like protein family and therefore may be 
involved in protein biogenesis. 

30 33. FuH-lenrth PRO1079 Polypeptides 

The present invemion provides newly identified and isolated nucleotide sequences encoding polypeptides 
referred to in the present application as PRO1079. In panicuiar, cDNA encoding a PRO1079 polypeptide has 
been identified and isolated, as disclosed in further detail in the Examples below. 

As far as is known, the DNA56050-1455. sequence encodes a novel factor designated herein as 
35 PRO 1079. Although, using WU-BLAST2 sequence alignment computer programs, some sequence identities 
to known protems was revealed. 
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34. FulMength PR0 793 Polvpcptides 

The present invention provides newly identified and isolated nucleotide sequences encoding polypeptides 
referred to in the present application as PR0793. In particular » cDNA encoding a PR0793 polypeptide has been 
identified and isolated, as disclosed in further detail in the Examples below. 

The DNA561 10-1437 clone was isolated from a human skin tumor library. As far as is known, the 
5 DNA561 10-1437 sequence encodes a novel factor designated herein as PR0793; using the WU-BLAST-2 
sequence alignment computer program, no significant sequence identities to any known proteins were revealed. 

35. FulMength PRO1016 Polvpcptides 

The present invention provides newly identified and isolated nucleotidesequences encoding polypeptides 
10 referred to in the present application as PRO 1016. In particular. Applicants have identified and isolated cDNA 
encoding a PRO1016 polypeptide, as disclosed in further detail in the Examples below. Using BLAST and 
FastA sequence alignment computer programs. Applicants found that various ponions of the PRO 1016 
polypeptide have sequence identity with acyltransferases. Accordingly, it is presently believed that PRO1016 
polypeptide disclosed in the present application is a newly identified member of the acyliransferase family and 
15 possesses acyltalation capabilities typical of this family. 



36. FulMength PRO1013 Polypeptides 

The present invention provides newly identified and isolated nucleotide sequences encoding polypeptides 
referred to in the present application as PRO 1013, In panicular. Applicants have identified cDNA encoding a 

20 PRO1013 polypeptide, as disclosed in further detail in the Examples below. The PRO 10 13-encoding clone came 
from a himian breast tumor tissue library. Thus, the PRO1013-encoding clone may encode a secreted factor 
related to cancer. To Applicants present knowledge, the DNA56410-1414 nucleotide sequence encodes a novel 
factor. Using BLAST and FastA sequence alignment computer programs, some sequence identity with 
K1AA01S7 and P120 was revealed. PRO1013 has at least one region in common with growth factor and 

25 cytokine receptors. 

37. FulMength PR0937 Polypeptides 

The present invention provides newly identified and isolated nucleotidesequences encoding polypeptides 
referred to in the present application as PR0937. In particular. Applicants have identified and isolated cDNA 
30 encoding a PR0937 polypeptide, as disclosed in further detail in the Examples below. Using BLAST and FastA 
sequence alignment computer programs, Applicants found that the PR0937 polypeptide has significant sequence 
identity with members of the glypican family of proteins. Accordingly, ii is presently believed that PR0937 
polypeptide disclosed in the present application is a newly identified member of the glypican family possesses 
propenies typical of the glypican family. 

35 
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38. FulHength PR0842 PolvDepttdes 

The present invention provides newly identified and isolated micleotide sequences encoding polypeptides 
referred to in the present application as PR0842. In particular, cDNA encoding a PR0842 polypeptide has been 
identified and isolated, as disclosed in further detail in the Examples below. 

As far as is known, the DNA56855-1447 sequence encodes a novel secreted factor designated herein 
5 as PR0842. However, using WU-BLAST2 sequence aligiuneni computer programs, some sequence identity to 
any known proteins were revealed. 

39. FulMength PR0839 Polypeptides 

The present mvemion provides newly identified and isolated nucleotidesequencesencoding polypeptides 
10 referred to in die present application as PR0839. In particular, cDN A encoding a PR0839 polypeptide has been 
identified and isolated, as disclosed in further detail in the Examples below. 

As far as is known, the DNA56859-1445 sequence encodes a novel factor designated herein as PR0839. 
However, using WU-BLAST-2 sequence alignment computer programs, some sequence identities to known 
proteins was revealed. 

15 

40. Full-length PRO1180 Polypeptides 

The present invention provides newly identiliedand isolated nucleotide sequences encoding polypeptides 
referred to in the present application as PROl 180. In particular. Applicants have identified and isolated cDNA 
encoding a PROl 180 polypeptide, as disclosed in further detail in the Examples below. Using BLAST and 
20 Fast A sequence alignment computer programs. Applicants found thai the PROl 180 polypeptide has significant 
similarity to methyl transferase enzymes. Accordingly, it is presently believed that PROl 180 polypeptide 
disclosed in the present application is a newly identified member of the mcthyliransferase family and possesses 
activity typical of diat family. 

25 41. Full-length PROl 134 Polvpeotides 

The present invention provides newly identified and isolated nucleotidesequencesencoding polypeptides 
referred to in the present application as PROl 134. In panicular, cDNA encoding a PROl 134 polypeptide has 
been identified and isolated, as disclosed in further detail in the Examples below. 

The DNA56865-1491 clone was isolated from a human fetal liver spleen library using a trapping 
30 technique which selects for nucleotide sequences encoding secreted proteins. Thus, the DNA56865-1491 clone 
does encode a secreted factor. As far as is known, the DNA56865-I491 sequence encodes a novel factor 
designated herein as PROl 134; using the WU-BLAST2 sequence alignment computer program, no significam 
sequence identities to any known proteins were revealed. 

35 42. Full-length PRO830 Polypeptides 

The present invention provides newly idemifiedand isolated nucleotide sequences encoding polypeptides 
referred to in the present application as PRO830. In particular, cDN A encoding a PR083D polypeptide has been 
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identified and isolated, as disclosed in further detail in the Examples below. 

The DNA56866-1342 clone was isolated from a human fetal liver/spleen library using a trapping 
technique which selects for nucleotide sequences encoding secreted proteins. Thus, the DNA568^1342 clone 
does encode a secreted factor. As far as is known, the DNA56866-1342 sequence encodes a novel factor 
designated herein as PRO830; using the WU-BLAST'2 sequence alignment computer program, no significant 
5 sequence identities to any known proteins were revealed. 

43, FulMenpth PR01115 Polvpeptides 
The present invemionprovides newly identified and isolated nucleotide sequences encoding polypeptides 
referred to in the present application as PROl 115. In particular, cDNA encoding a PROl 1 15 polypeptide has 
10 been identified and isolated, as disclosed in further detail in the Examples below. 

As far as is known, the DNA56868-1478 sequence encodes a novel transmembrane protein designated 
herein as PROl 1 15. Although, using WU-BLAST-2 sequence alignment computer programs, some sequence 
identities to known proteins were revealed. 

15 44. FuH-len^h PR01277 Polypeptides 

The present invention provides newly identified and isolated nucleotide sequences encoding polypeptides 
referred to in the present application as PR01277. In particular, cDNA encoding a PR01277 polypeptide has 
been identified and isolated, as disclosed in further detail in the Examples below. 

Using WU-BLAST-2 sequence alignment computer programs » it has been found that a full-length native 
20 sequence PR01277 (shown in Figure 1 13 and SEQ ID NO: 179) has ccnain amino acid sequence identity with 
Coch-5B2 protein (designated **AF012252_P in the Dayhoff database). Accordingly, it is presently believed 
that PRO 1277 disclosed in the present application is a newly identified member of the Coch-5B2 protein family 
and may possess the same activities and propenies as Coch-5B2. 

25 45. FulMength PROl 135 Polypeptides 

The present invention provides newly identified and isolated nucleotide sequencesencoding polypeptides 
referred to in the present application as PROl 135. In panicular. Applicants have identified and isolated cDNA 
encoding a PROl 135 polypeptide, as disclosed in further detail in the Examples below. Using BLAST and 
FastA sequence alignment computer programs. Applicants found that the PROl 135 polypeptide has significant 

30 similarity to the alpha 1 ,2-mannosidase protein. Accordingly « it is presently believed that PROl 135 polypeptide 
disclosed in the present application is a newly identified member of the mannosidase enzyme family and 
possesses activity typical of that family of proteins. 

46. FulMength PROl H 4 Polvpeotides 

35 The present invention provides newly identified and isolated nucleotide sequences encoding polypeptides 

referred to in the present application as PROl 1 14 interferon receptor. In particular, cDNA encoding a PROl 1 14 
interferon receptor polypeptide has been identified and isolated, as disclosed in further detail in the Examples 
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below. 

Using the WU-BLAST-2 sequence alignmem computer program, it has been found that a full-length 
native sequence PROl 114 interferon receptor polypeptide (shown in Figure 117 and S£Q ID NO: 183) has 
sequence identity with the other known interferon receptors. Accordingly, it is presently believed that PROl 1 14 
interferon receptor possesses activity typical of other interferon receptors. 

5 

47. FulHength PR0828 Polypeptides 
The present invention provides newly identified and isolated nucleotide sequences encoding polypeptides 
referred to in the present application as PR0828. In particular. Applicants have identified and isolated cDNA 
encoding a PR0828 polypeptide, as disclosed in iiirther detail in the Examples below. Using BLAST and FastA 
10 sequence alignment computer programs. Applicants found that the PR0828 polypeptide has sequence identity 
with glutathione peroxidases. Accordingly, it is presently believed that PR0828 polypeptide disclosed in the 
present application is a newly identified member of the glutathione peroxidase family and possesses peroxidase 
activity and other properties typical of glutathione peroxidases. 



15 48. Full-iength PRO1009 Polypeptides 

The present invention provides newly identified and isolated nucleotide sequences encoding polypeptides 
referred to in the present application as PRO1009. In panicular, cDNA encoding a PRO1009 polypeptide has 
been identified and isolated, as disclosed in further detail in the Examples below. 

Using WU-BLAST'2 sequence alignment computer programs, it has been found that a iull-iength native 
20 sequence PRO1009 (shown in Figure 122 and SEQ ID NO: 194) has certain amino acid sequence identity with 
long-chain acyl-CoA synthetase homolog designated **F69893". Accordingly, it is presently believed that 
PRO 1009 disclosed in the present application is a newly identified member of the long-chain acyl-CoA synthetase 
family and may possess activity related to this family. 

25 49. Fuli-length PROI007 Polypeptides 

The present invention provides newly identified and isolated nucleotidesequencesencodmg polypeptides 
referred to in the present sqiplication as PR01(X)7. In panicular. Applicants have identified and isolated cDNA 
encoding a PRO1007 polypeptide, as disclosed in fixrther detail in the Examples below. Using BLAST and 
FastA sequence alignment computer programs. Applicants found that various portions of the PRO 1007 

30 polypeptide have sequence identity with MAGPIAP. Accordingly, it is presently believed that PRO1007 
polypeptide disclosed in the present application is a newly identified member of the MAGPIAP family and is 
associated with metastasis and/or cell signaling and/or cell replication. 



50. Full-length PRO1056 Polypeptides 
35 The present invention provides newly identified and isolated nucleotide sequences encoding {K>lypq)tides 

referred to in the present application as PRO 1056. In particular, cDNA encoding a PROI056 polypeptide has 
been identified and isolated, as disclosed in further detail in the Examples below. 
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Using the WU-BLAST-2 sequence alignment computer program, it has been found that a full-length 
native sequence PRO10S6 (shown in Figure 127 and SEQ ID NO: 199) has amino acid sequence identity with 
a chloride channel protein. Accordingly, it is presently believed that PRO1056 disclosed in the present 
application is a newly identified chloride channel protein homolog. 

5 51. FulMeneth PR0826 Polypeptides 

The present invention providesnewly identified and isolated nucleotide sequences encoding polypeptides 
referred to in the present application as PR0826. In particular, cDNA encoding a PR0826 polypeptide has been 
identified and isolated, as disclosed in further detail in the Examples below. 

The DNA57694-1341 clone was isolated from a human fetal heart library using a trapping technique 
10 which selects for nucleotide sequences encoding secreted proteins. Thus, the DNA57694-1341 clone does 
encode a secreted factor. As far as is known, the DNA57694-1341 sequence encodes a novel factor designated 
herein as PR0826; using the WU-BLAST-2 sequence alignment computer program, no significant sequence 
identities to any known proteins were revealed. 

IS 52. Full-length PR0819 PolvDeDtides 

The present invention provides newly identified and isolated nucleotide sequences encoding polypeptides 
referred to in the present application as PR0819. In particular, cDNA encoding a PR0819 polypeptide has been 
identified and isolated, as disclosed in further detail in the Examples below. 

The DNA57695-1340 clone was isolated from a human feial liver spleen library using a trapping 
20 technique which selects for nucleotide sequences encoding secreted proteins. Thus, the DNA57695-1340 clone 
does encode a secreted factor. As far as is known, the DNA57695-1340 sequence encodes a novel factor 
designated herein as PROS 19; using the WU-BLAST-2 sequence alignment computer program, no significant 
sequence identities to any known proteins were revealed. 

25 53. Full-length PROI006 Polypeptides 

The present invention provides newly identified and isolated nucleotidesequencesencoding polypeptides 
referred to in the present application as PRO1006. In particular. Applicants have identified and isolated cDNA 
encoding a PRO1006 polypeptide, as disclosed in further detail in the Examples below. The PRO1006-encoding 
clone was isolated from a human uterus library. To Applicants present knowledge, the DNA57699-1412 
30 nucleotide sequence encodes a novel factor; using BLAST and FastA sequence alignment computer programs, 
some sequence identity with a putative tyrosine protein kinase was revealed. 

54. FulNen^h PR01I12 Polypeptides 
The present invention provides newly identified and isolated nucleotide sequences encoding polypeptides 
35 referred to in the present application as PROl 1 12. In panicular. Applicants have identified cDN A encoding a 
PROl 112 polypeptide, as disclosed in further detail in Example 1 below. To Applicants present knowledge, the 
DNA57702-1476 nucleotide sequence encodes a novel factor, although using BLAST and FasiA sequence 
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alignment computer programs some sequence identity with other known proteins was found. 

55. FulMength PRO1074 Polypeptides 

The present invention provides newly identified and isolated nucleotide sequences encoding polypeptides 
referred to in the present application as PRO1074. In panicular. Applicants have identified and isolated cDNA 
S encoding a PRO 1074 polypeptide, as disclosed in further detail in the Examples below. Using BLAST and 
FastA sequence aligmnent computer programs. Applicants found that ihe PRO1074 polypeptide has sequence 
identity with galactosyltransferase. Accordingly, it is presently believed that PRO1074 polypeptide disclosed 
in the present application is a newly identified member of the galactosyltransferase family and possesses 
galactosyltransferase activity. 

10 

56. Full-length PROIOQS Polypeptides 

The present inventionprovides newly identified and isolated nucleotide sequences encoding polypeptides 
referred to in die present ^plication as PROIOOS. In panicular. cDNA encoding a PROIOOS polypeptide has 
been identified and isolated, as disclosed in further detail in the Examples below. 
IS As far as is known, the DNA57708-1411 sequence encodes a novel factor designated herein as 

PROIOOS. However, using WU-BLAST2 sequence alignment computer programs, some sequence identities with 
known proteins was revealed. 

57. Full-length PRO1073 Polypeptides 

20 The present invention provides newly identified and isolated nucieoiide sequences encodingpolypeptides 

referred to in the present application as PRO1073. In particular, cDNA encoding a PRO1073 polypeptide has 
been identified and isolated, as disclosed in fiirUier detail in the Examples below. 

As far as is known, the DNA57710 sequence encodes a novel secreted factor designated herein as 
PRO 1073. However, using WU-BLAST2 sequence alignment computer programs, some sequence identities to 

25 known proteins were revealed. 



58. Full-length PR011S2 Polypeptides 
The present invention provides newly identified and isolated nucleotidesequences encoding polypeptides 
referred to in die present application as PROl 152. In panicular, cDN A encoding a PROl 152 polypeptide has 
30 been identified and isolated, as disclosed in further detail in the Examples below. 

The DNA5771 1-1501 clone was isolated from a human infant brain library. As far as is known, the 
DNA57711-1501 sequence encodes a novel factor designated herein as PROl 152; using die WU-BLAST-2 
sequence alignment computer program, no significant sequence identhies to any known proteins were revealed. 

35 59. Full-length PR01136 Polypeptides 

The present inventionprovides newly idemifiedand isolated nucleotide sequences encoding polypeptides 
referred to in die present application as PROl 136. In panicular, cDNA encoding a PROl 136 polypeptide has 
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been identified and isolated, as disclosed in further detail in the Examples below. 

Using the WU-BLAST2 sequence alignment computer program, it has been found that a full-length 
native sequence PROl 136 (shown in Figure 147 and SEQ ID NO:219) has amino acid sequence identity with 
PDZ domain-containing proteins. Accordingly, it is presently believed that PROl 136 disclosed in the present 
application is a newly identified member of the PDZ domain-containing protein family and may possess activity 
5 typical of that family. 

60. Full-length PR0813 PolvpeDtides 
The present inventionprovides newly identified and isolated nucleotide sequences encoding polypeptides 
referred to in the present application as PR0813. in particular. Applicants have identified and isolated cDNA 
10 encoding a PR0813 polypeptide, as disclosed in further detail in the Examples below. Using BLAST and FastA 
sequence alignment computer programs. Applicants foimd that the PROS 1 3 polypeptide has significant similarity 
to the pulmonary surfactant-associated protein C. Accordingly, it is presently believed that PROS 13 polypeptide 
disclosed in the present application is a newly identified pulmonary surfactant-associated protein C homolog. 

15 61. Full-length PRO809 Polypeptides 

The present invention provides newly identified and isolated nucleotide sequences encoding polypeptides 
referred to in the present application as PRO809. In particular, Applicancs have identified and isolated cDNA 
encoding a PRO809 polypeptide, as disclosed in further detail in the Examples below. To Applicants present 
knowledge, the DNA57836-1338 nucleotide sequence encodes a novel factor. 

20 

62. FulMength PR0791 Polypeptides 

The present invention provides newly identified and isolated nucleotide sequences encoding polypeptides 
referred to in the present application as PR0791. In particular, Applicants have identified and isolated cDNA 
encoding a PR079] polypeptide, as disclosed in further detail in the Examples below. To Applicants present 
25 knowledge, the DNA57838-1337 nucleotide sequence encodes a novel factor; however, using BLAST and FastA 
sequence alignment computer programs, there does appear to be some sequence identity with MHC-1 antigens, 
indicating that PR0791 may be related thereto in structure and function. 

63. Full-length PROl 004 Polypeptides 

30 The present invention provides newly identified and isolated nucleotide sequences encoding polypeptides 

referred to in the present application as PRO1004. In panicular, cDNA encoding a PRO1004 polypeptide has 
been identified and isolated, as disclosed in further detail in the Examples below. 

As far as is known, the DNA57844-1410 sequence encodes a novel factor designated herein as 
PRO1004. However, using WU-B1-AST2 sequence alignment computer programs, some sequence identities with 

35 known proteins were revealed. 
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64. Full-length PROIlll Polypeptides 

The present inveniion provides newly identified and isolated nucleotide sequences encoding polypq}tides 
referred to in the present application as PRO 1111. In particular, cDN A encoding a PRO 1111 polypeptide has 
been identified and isolate, as disclosed in further detail in the Examples l^low. 

Using WU-BLAST2 sequence alignment computer programs, it has been found that a full-length native 
5 sequence PROl 1 1 1 (shown in Figure 157 and SEQ ID NO:229) has certain amino acid sequence identity with 
LIG. Accordingly, it is presently believed that PROIlll disclosed in the present application is a newly 
identified member of this glycoprotein family. 

65. Full-length PR01344 Polypeptides 

10 The present inventionprovides newly identified and isolated nucleotide sequences encoding polypeptides 

referred to in the present s^lication as PR01344. In panicular, cDNA encoding a PROI344 polypeptide has 
been identified and isolated, as disclosed in further detail in the Examples below. 

Using the WU-BLAST2 sequence alignment computer program, it has been found that a full-length 
native sequence PR01344 (shown in Figure 159 and SEQ ID NO:231) has certain amino acid sequence identity 

IS with the factor C protein of Carcinoscorpius rotundicauda. Accordingly, it is presently believed that PR01344 
disclosed in the present implication is a newly identified factor C protein and may possess activity typical of that 
protein. 

66. Full-length PROl 109 Polypeptides 

20 The present invention provides newly identified and isolated nucleotide sequences encoding polypeptides 

referred to in the present application as PRO! 109. In particular, cDNA encoding a PROl 109 polypeptide has 

been identified and isolated, as disclosed in further detail in the Examples below. 

Using the WU-BLAST2 sequence alignment computer program, it has been found that a fuil-lengdi 

native sequence PROl 109 (shown in Figure 161 and SEQ ID NO:236) has certain ammo acid sequence identity 
25 with the human UDP-Gal:GlcNAc galactosyltransferase protein. Accordingly, it is presently believed that 

PRO 1109 disclosed in the present application is a newly identified P-galactosyltransferase enzyme and has 

activity typical of those enzymes. 

67. FulHength PR01?83 Polypeptides 

30 The present invention provides newly identified and isolatednucleotidesequences encoding polypeptides 

referred to in the present application as PR01383. In particular, cDNA encoding a PR01383 polypeptide has 

been idemified and isolated, as disclosed in further detail in the Examples below. 

Using the WU-BLAST2 sequence alignment computer program, it has been found that a fiilMength 

native sequence PR01383 (shown in Figure 163 and SEQ ID NO:241) has certain amino acid sequence identity 
35 with the putative human transmembrane protein nmb precursor (NMB_HUMAN). Accordingly, it is presently 

believed that PR01383 disclosed in the present application is a newly identified nmb homolog. 
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68. Full-length PRO1003 Polypeptides 
The present invention provides newly identified and isolated nucleotide sequences encoding polypeptides 
referred to in the present apphcation as PRO 1003. In panicular. Applicants have identified and isolated cDNA 
encoding a PRO1003 polypeptide, as disclosed in further deuil in the Examples below. The PRO1003-encoding 
clone was isolated from a human breast tumor tissue library. The PRO1003-encoding clone was isolated using 
5 a trapping technique which selects for nucleotide sequences encoding secreted proteins. Thus, the PRO 1003- 
encoding clone may encode a secreted factor. To Applicants present knowledge, die UNQ487 (DNA58846- 
1409) nucleotide sequence encodes a novel factor; using BLAST and Fast A sequence alignment computer 
programs, no sequence identities to any known proteins were revealed. 

10 69. Full-length PRO1108 Polypeptides 

The present invention provides newly identified and isolated nucleotidesequences encoding polypeptides 
referred to in the present application as PROl 108. In panicular« Applicants have identified and isolated cDNA 
encoding a PROl 108 polypeptide, as disclosed in further detail in the Examples below. Using BLAST and 
FastA sequence alignment computer programs, Applicants foimd that the PROl 108 polypeptide has significant 

15 similarity to the LPAAT protein. Accordingly, it is presently believed that PROl 108 polypeptide disclosed in 
the present plication is a newly identified LPAAT homolog. 

70. Full-length PROIB? Polypeptides 

The present invention provides newly identifiedand isolated nucleotide sequences encoding polypeptides 
20 referred to in the present application as PROl 137. In panicular, Applicants have identified and isolated cDNA 
encoding a PRO 11 37 polypeptide, as disclosed in further detail in the Examples below. Using BLAST and 
FastA sequence alignment computer programs. Applicants found that the PRO 11 37 polypeptide has sequence 
identity with ribosyltransferases. Accordingly, it is presently believed that PROl 137 polypeptide disclosed in 
the present application is a newly identified member of the ribosyltransferase family and possesses 
25 ribosyltransferase activity. 



71. FulMeneth PROl 138 Polypeptides 

The present invention provides newly identified and isolated nucleotidesequences encoding polypeptides 
referred to in the present application as PROl 138. In panicular. Applicants have identified and isolated cDNA 
30 encoding a PRO 11 38 polypeptide, as disclosed in further detail in the Examples below. Using BLAST and 
FastA sequence alignment computer programs. Applicants found that the PROl 138 polypeptide has sequence 
identity with CD84 leukocyte antigen. Accordingly, it is presently believed that PROl 138 polypeptide disclosed 
in the present application is a newly identified member of the Ig superfamily and has activity typical of other 
members of the Ig superfamily. 
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72. Full-igngth PRO1054 Polypeptides 
The present invention provides newly identified and isolated nucleotide sequences encoding poIypq)Udes 
referred to in the present application as PRO1054. In particular. cDNA encoding a PRO1054 polypeptide has 
been identified and isolated, as disclosed in further detail in die Examples below. 

Using the WU-BLAST2 sequence alignment computer program, it has been found that a full-length 
5 native sequence PROI054 (shown in Figure 174 and SEQ ID NO:256) has amino acid sequence identity with 
one or more of the major urinary proteins. Accordingly, it is presenUy believed that PRO10S4 disclosed in the 
present application is a newly identified member of the MUP family and may possess activity typical of that 
family. 

10 73. Full-length PR 0994 Pdlvpcptides 

The present invention provides newly identified and isolated nucleotidesequences encodmg polypeptides 
referred to in the presem application as PR0994. In panicular, cDNA encoding a PR0994 polypeptide has been 
identified and isolated, as disclosed in further detail in die Examples below. 

Using die WU-BLAST2 sequence alignment computer program, it has been found diat a fiill-lengdi 
15 native sequence PR0994 (shown- in Figure 176 and SEQ ID NO:258) has amino acid sequence identity widi die 
tumor-associated antigen L6. Accordingly, it is presemly believed diat PR0994 disclosed in die present 
application is a newly identified L6 antigen homolog. 

74. FulMeneth PR0812 Polvpeotides 

20 The present invention provides newly identified and isolated nucleotide sequences encodingpolypeptides 

referred to in the present application as PR0812. In panicular, cDNA encoding a PR0812 polypeptide has been 

identified and isolated, as disclosed in further detail in die Examples below. 

Using die WU-BLAST2 sequence alignment computer program, it has been found diat a fiill-lengdi 

native sequence PR08I2 (shown in Figure 178 and SEQ ID NO:260) has amino acid sequence identity widi die 
25 prostatic steroid-binding cl protein. Accordingly, it is presently believed dial PR0812 disclosed in die present 

application is a newly identified prostatic steroid-binding cl protein homolog. 

75. FulMeneth PRO1069 Polvoeptides 

The present invemionprovides newly identified and isolated nucleotide sequences encoding polypeptides 
30 referred to in die present application as PRO1069. in particular. Applicants have identified and isolated cDNA 
encoding a PROI069 polypeptide, as disclosed in funher detail in die Examples below. Using BLAST and 
FastA sequence alignment computer programs, it was found diai die PRO 1069 polypeptide has sequence identity 
widi CHIF. Accordingly, it is presendy believed dial PRO 1069 polypeptide disclosed in die present application 
is a newly identified CHIF polypeptide and is involved in ion conductance or regulation of ion conductance. 

35 
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76. FulUlength PR01129 Polypeptides 

The present invention provides newly identified and isolated nucleotide sequences encoding polypeptides 
referred to in the present application as PRO! 129. In panicular. Applicants have identified and isolated cDNA 
encoding a PROl 129 polypeptide, as disclosed in further detail in the Examples below. Using BLAST and 
FasiA sequence alignment computer programs. Applicants foimd that the PROl 129 polypeptide has significant 
5 similarity to the cytochrome P-450 family of proteins. Accordingly, it is presently believed that PROl 129 
polypeptide disclosed in the present application is a newly identified member of the cytochrome P'4S0 family 
and possesses activity typical of that family. 

77. Full-length PRO1068 PoWDCDtldes 

10 The present inventionprovides newly identified and isolated nucleotide sequences encoding polypeptides 

referred to in the present application as PRO1068. In panicular, cDNA encoding a PRO1068 polypeptide has 

been identified and isolated, as disclosed in further detail in the Examples below. 

Using WU-BLAST2 sequence alignment computer programs, it has been foimd that a full-length native 

sequence PRO1068 has amino acid sequence identity with urotensin. Accordingly, it is presently believed that 
15 PRO1068 disclosed in the present application is a newly identified member of the urotensin family and may 

possess activity typical of the urotensin family. 

78. Full-length PROI066 Poivpeptides 

The present invention provides newly ideniifiedand isolated nucleoiide sequences encoding polypeptides 
20 referred to in the present application as PRO1066. In particular, Applicants have identified and isolated cDNA 
encoding a PRO1066 polypeptide, as disclosed in fiirther detail in the Examples below. The PRO1066-encoding 
clone was isolated from a human pancreatic mmor tissue library using a trapping technique which selects for 
nucleotide sequences encoding secreted proteins. Thus, the PRO1066-encoding clone may encode a secreted 
factor. To Applicants present knowledge, the DNA59215-1425 nucleotide sequence encodes a novel factor; 
25 using BLAST and FastA sequence alignment computer programs, no sequence identities to any known proteins 
were revealed. 

79. Full-length PROl 184 Polypeptides 

The present inventionprovides newly identified and isolated nucleotide sequences encoding polypeptides 
30 referred to in the present application as PROl 184. In panicular. Applicants have identified cDNA encoding a 
PROl 184 polypeptide, as disclosed in further detail in the Examples below. To Applicants present knowledge, 
the DNA59220-1514 nucleotide sequence encodes a novel secreted factor. 

80. Full-length PROI360 Polypeptides 

3 5 The present invention provides newly identified and isolated nucleotide sequences encoding polypeptides 

referred to in the present application as PRO1360. In particular, cDNA encoding a PRO1360 polypeptide has 
been identified and isolated, as disclosed in further detail in the Examples below. 
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As far as is known, the DNA59488-1603 sequence encodes a novel factor designated herein as 
PRO1360; using WU-BLAST2 sequence alignment computer programs, no significant sequence identities to any 
known proteins were revealed. Some sequence identities were revealed, as indicated below in the examples. 

81. FuiMength PRO1029 Polypeptides 

5 The present invention provides newly identified and isolated nucleotide sequencesencodingpolypeptides 

refeired to in the present application as PROi029. In particular, cDNA encoding a PRO1029 polypeptide has 
been identified and isolated, as disclosed in further detail m the Examples below. 

The DNAS9493-1420 clone was isolated fit)m a human fetal liver spleen library using a trapping 
technique which selects for nucleotide sequences encoding secreted proteins. Thus, the DNA59493-1420 clone 
10 does encode a secreted factor. As far as is known, the DNA59493-1420 sequence encodes a novel factor 
designated herein as PRO 1029; using the WU-BLAST2 sequence alignment computer program, no sequence 
identities to any known proteins were revealed. 

82. Full-length PR01139 Polypeptides 

15 The present invention pi:ovides newly identified and isolatednucleotide sequences encoding polypeptides 

referred to in the present application as PROl 139. In particular. Applicants have identified and isolated cDNAs 
encoding PROl 139, as disclosed in further detail in the Examples below. Using BLAST and FastA sequence 
alignment computer programs. Applicants found that the human PROl 139 protein originally identified exhibits 
a significant sequence homology to the a OB receptor associated protein HS0BRGRP_1, described by BaiUeul 

20 et al.. Nucleic Acids Res. 25. 2752-2758 (1997) (EMBL Accession No: Y12670). 

83. FulMength P|tO1309 Polypeptides 

The present invention provides newly identified and isolated nucleotide sequences encoding polypeptides 
referred to in the present application as PRO1309. In panicular, cDNA encoding a PROI309 polypeptide has 
25 been identified and isolated, as disclosed in further detail in the Examples below. 

Using the WU-BLAST2 sequence alignment computer program, it has been found that a full-length 
native sequence PROI309 (shown in Figure 196 and SEQ ID NO:278) has certain amino acid sequence identity 
with a protein designated KIAA0416, given the Dayhoff designation AB(X)7876_1. Moreover. PRO1309 has 
leucine rich repeats, accordingly, it is presently believed that PRO1309 disclosed in the present application is 
30 a newly identified member of the leucine rich protein family and may be involved in protein protein interactions. 

84. Full-length PRO1028 Polypeptides 

The present invention provides newly identified and isolated nucleotide sequences encoding polypeptides 
referred to in the present application as PRO1028. In panicular. Applicants have identified and isolated cDNA 
35 encoding a PRO1028 polypeptide, as disclosed in further detail in the Examples below. To Applicants present 
knowledge, the DNA59603-1419 nucleotide sequence encodes a novel factor. BLAST and FastA sequence 
alignment computer programs showed some sequence identity with proteins such as those designated ASSOSO" 
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85. FulMen^h PRO1027 Polypeptides 

The present invention provides newly identified and isolated nucleotide sequences encodingpolypeptides 
referred to in the present application as PRO1027. In panicular. Applicants have identified and isolated cDNA 
5 encoding a PRO1027 polypeptide, as disclosed in further detail in the Examples below. The PRO1027-€ncoding 
clone was identified in a human uterine cervical tissue library. To Applicants present knowledge, the 
DNA59605-1418 nucleotide sequence encodes a novel factor. 



86. Full-length PROH07 Poivpeptides 

10 The present inventionprovides newly identified and isolated nucleotide sequences eircoding polypeptides 

referred to in the present application as PROl 107. In panicular. Applicants have identified and isolated cDNA 
encoding a PROl 107 polypeptide, as disclosed in further detail in the Examples below. Using BLAST and 
FastA sequence alignment computer programs, Applicants found that the PROl 107 polypeptide has some 
similarity to the PC>1 protein, human insulin receptor tyrosine kinase inhibitor, an alkaline phosphodiesterase, 

15 and autotaxin. Accordingly, it is presently believed that PROl 107 polypeptide disclosed in the present 
application is a newly identified member of (he phosphodiesterase family. 



87. Full-length PRO1I40 Polypeptides 

The present invention provides newly identified and isolated nucleotide sequences encoding novel mulii- 
20 span transmembrane polypeptides referred to in the present application as PROl 140. In particular. Applicants 
have identified and isolated cDNA encoding a PROl 140 polypeptide, as disclosed in further detail in the 
Examples below. Using BLAST and FastA sequence alignment computer programs, some sequence identity with 
known proteins was found. 



25 88, FulHeneth PRO1106 Polypeptides 

The present invention provides newly identified and isolated nucleotide sequencesencoding polypeptides 
referred to in the present application as PROl 106. In particular. Applicants have identified and isolated cDNA 
encoding a PROl 106 polypeptide, as disclosed in further detail in the Examples below. Using BLAST and 
FastA sequence alignment computer programs, Applicants found that the PROl 106 polypeptide has significant 

30 similarity to the peroxisomal calcium-dependent solute carrier. . Accordingly, it is presently believed that 
PRO 1106 polypeptide disclosed in the present application is a newly identified member of the mitochondrial 
carrier superfamily and possesses transporter activity typical of this family. 



89. Full-length PR01291 Polypeptides 
35 The present invention provides newly identified and isolated nucleotide sequences encoding polypeptides 

referred to in the present application as PR01291 . In panicular, cDNA encoding a PR01291 polypeptide has 
been identified and isolated, as disclosed in further detail in the Examples below. 
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Using the WU-BLAST2 sequence alignment computer program, it has been found that a full-length 
native sequence PR01291 (shown in Figure 208 and SEQ ID NO:291) has cenain amino acid sequence identity 
with the butyrophilin protein. Accordingly, it is presently believed that PR01291 disclosed in the present 
application is a newly identified butyrophilin homoiog and may possess activity typical of that protein. 

5 90. Full-length PRO1105 Polypeptides 

The present invention provides newly identified and isolated nucleotide sequences encoding polypeptides 
referred to in the present application as PROllOS. In particular. Applicants have identified cDNA encoding a 
PROI 105 polypeptide, as disclosed in further detail in the Examples below. To Applicants present knowledge, 
the DNA5%12-I466 nucleotide sequence encodes a novel factor. There is, however, some sequence identity 
10 with a peroxydase precursor designated in a Dayhoff database as *'ATTS1623_1 **. 

91. Full-len^h PROSll Polypeptides 

The present invention provides newly identified and isolatednucleotide sequences encoding polypeptides 
referred to in the present application as PROSll. In particular. Applicants have identified and isolated cDNA 
15 encoding a PROS 1 1 polypeptide, as disclosed in further deuil in the Examples below. The PROS 1 1 -encoding 
clone was isolated from a human colon tissue library. To Applicants present knowledge, the DNAS9613-1417 
nucleotide sequence encodes a novel factor; using BLAST and FasiA sequence alignment computer programs, 
sequence identities with RoBo-l, phospholipase inhibitors and a protein designated as •*SSC20F10_r were 
revealed, indicated that PROSl 1 may be related to one or more of these proteins. 

20 

92. FulMength PRO1I04 Polypeptides 

The present invention provides newly identified and isolated nucleotide sequencesencodingpolypeptides 
referred to in the present application as PROI 104. In particular. Applicants have identified and isolated cDNA 
encoding a PROI 104 polypeptide, as disclosed in further deuil in the Examples below. To Applicants present 
25 knowledge, the DNAS9616-1465 nucleotide sequence encodes a novel factor; using BLAST and FastA sequence 
alignment computer programs, some sequence identity appeared with proteins designated as *'AB002107_1", 
-AF022991_^r and •'SP%^DICDr. 

93. FulNenirth PROllOO Polypeptides 

30 The present invention provides newly identified and isolatednucleotide sequences encoding polypeptides 

referred to in the present application as PROI 100. In particular. Applicants have identified cDNA encoding a 
PROI 100 polypeptide, as disclosed in further detail in the Examples below. To Applicants present knowledge, 
the DNAS9619-1464 nucleotide sequence encodes a novel factor; using BLAST and FastA sequence alignment 
computer programs, only some sequence identity with known proteins was revealed. There is some sequence 

35 identity with the yeast hypothetical 42.S KD protein in TSMl-AREl intergenic region (ACCESSION 
NO: 1404%), designated -YSCT4^YEAST-. 
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94. Fuli^length PR0836 Polypeptides 

The present inveniion provides newly identified and isolated nucleotide sequences encoding polypeptides 
referred to in the present application as PR0836. In particular. Applicants have identified and isolated cDNA 
encoding a PR0836 polypeptide, as disclosed in further detail in the Examples below. To Applicants present 
knowledge, the DNA59620'1463 nucleotide sequence encodes a novel factor. Using BLAST and FastA 
5 sequence alignment computer programs, there apj^ars to be some sequence identity with SLSl . 

95. FulMength PR01141 Polypeptides 

The present invention provides newly identified and isolated nucleotide sequences encoding polypeptides 
referred to in the present application as PROl 141 . In panicular, cDNA encoding a PROl 141 polypeptide has 
10 been identified and isolated, as disclosed in further detail in the Examples below. 

The DNA59625-1498 clone was isolated torn a human ileum tissue library. As far as is known, the 
DNA59625-1498 sequence encodes a novel factor designated herein as PR01141; using the WU-BLAST2 
sequence alignment computer program, no sequence identities to any known proteins were revealed. 

15 96. Full-length PROl 132 Polypeptides 

The present invention provides newly identified and isolated nucleotide sequences encoding polypeptides 
referred to in the present application as PROl 132, In particular, cDNA encoding a PROl 132 polypeptide has 
been identified and isolated, as disclosed in further detail in the Examples below. 

Using WU-BLAST2 sequence alignment computer program, it has been foimd that a full-length native 
20 sequence PROl 132 (shown in Figure 226 and SEQ ID NO: 309) has certain amino acid sequence identity with 
enamel matrix serine proteinase 1 and neuropsin. Accordingly, it is presently believed that PRO 11 32 disclosed 
in the present application is a newly identified member of the serine protease family and may possess protease 
activity typical of this family. 

25 97. Full-length PR01346 Polypeptides 

The present invention provides newly identified and isolated nucleotidesequences encoding polypeptides 
referred to in the present application as NL7 (UNQ701). In particular, cDNA encoding an NL7 polypeptide has 
been identified and isolated, as disclosed in further detail in the Examples below. 

As disclosed in the Examples below, a clone DNA59776-1600 has been deposited with ATCC. The 
30 acmal nucleotide sequence of the clone can be readily determined . by the skilled artisan by sequencing of the 
deposited clone using routine methods in the an. The predicted amino acid sequence can be determined from 
the nucleotide sequence using routine skill. For the NL7 (PR01346) herein. Applicants have identified what 
is believed to be the reading frame best identifiable with the sequence information available at the time of filing. 
Using WU-BLAST2 sequence alignment computer programs, it has been found that a full-length native 
35 sequence NL7 (shown in Figure 228 and SEQ ID NO:314) has cenain amino acid sequence identity with 
microfibril-associaied glycoprotein 4 (MFA4_HUMAN);ficolin-A - Mus musculus (AB007813_1); human lectin 
P35 (D63155S6_1); ficolin B - Mus musculus (AFO063217 1); human lenascin-R (restriction) (HS518E13_I); 
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the long fonn of a rat janusin precursor (A45445); fibrinogen-reiated protein HFREP-1 precursor (JN0596); 
a human Tenascin precursor (TENA HUMAN); human CDT6 (HSY16132_1): and angiopoieiin-1 - Mus 
musculus (MMU83509_l). It is presently believed that NL7 disclosed in the present application is a novel TIE 
ligand homologue, and may play a role in angiogenesis and/or vascular maintenance and/pr wound healing 
and/or inflanunation and/or mmor development and/or growth 

5 

98. FulMength PROI131 Polvpeptides 
The present invention provides newly identified and isolated nucleotide sequencesencoding polypeptides 
referred to in the present 24>plication as PR01131. In panicular, cDNA encoding a PR01131 polypeptide has 
been identified and isolated, as disclosed in further detail in the Examples below. 
10 Using WU-BLAST2 sequence alignment computer programs, it has been found that a fiilMength native 

sequence PR01131 (shown in Figure 230 and SEQ ID NO:319) has certain amino acid sequence identity with 
a lectin-Iike oxidized LDL receptor. Accordingly, it is presently believed that PROl 131 disclosed in the present 
application may have at least one mechanism similar to those of the LDL receptors. 

15 99. Full-length PR01281 Poivpeptidcs 

The present invention provides newly identified and isolated nucleotide sequences encoding polypeptides 
referred to in the present application as PR0128L In panicular, cDNA encoding a PR01281 polypeptide has 
been identified and isolated, as disclosed in further detail in the Examples below. 

The DNA59820-1549 clone was isolated from a human fetal liver library using a trapping technique 
20 which selects for nucleotide sequences encoding secreted proteins. Thus, as far as is known, the DNA59820- 
1549 sequence encodes a novel factor designated herein as PRO 128 1 . Using WU-BLAST2 sequence alignment 
computer programs, some sequence identities to known proteins was found, but determined not to be significant. 

100. f ulNength PyO1064 Polypeptides 

25 The presem invemionprovidesnewly identified and isolated nucleotide sequences encoding polypeptides 

referred to in the present application as PRO1064. In particular, cDNA encoding a PRO1064 polypeptide has 
been identified and isolated, as disclosed in further detail in the Examples below. 

The DNA59827-I426 clone was isolated from a human fetal kidney library. As far as is known, the 
DNA59827-1426 sequence encodes a novel factor designated herein as PRO1064; using die WU-BLAST2 

30 sequence aligiunem computer program, no significant sequence identities to any known proteins were revealed. 

101. FulNength PR01379 Polypeptides 

The presem invention provides newly identified and isolated nucleotide sequences encoding polypeptides 
referred to in the present application as PRO 1379. In panicular, cDNA encoding a PRO 1379 polyi)eptide has 
35 been identified and isolated, as disclosed in further detail in the Examples below. 

The DNA59828 clone was isolated ft-om a human fetal kidney library. As far as is known, the 
PRO 1379 polypeptide encoded thereby is a novel secreted factor. Using WU-BLAST2 sequence alignment 
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computer programs, sequence identity was found between PR01379 and a hypothetical yeast protein 
"YHYS YEAST" (Dayhoff database; version 35.45 SwissProt 35), particularly at the C-ierminal ends. 
Sequence homologies with other known proteins were revealed, but determined not to be significant. 

102. FulHength PR0844 Polypeptides 

5 The present invention provides newly identified and isolated nucleotide sequences encoding polypeptides 

referred to in the present application as PR0844. In panicular. Applicants have identified and isolated cDNA 
encoding a PR0844 polypeptide, as disclosed in further deuil in the Examples below. Using BLAST and FastA 
sequence aligiunent computer programs. Applicants found that the PR0844 polypeptide has sequence identity 
with serine protease inhibitors. Accordingly, h is presently believed that PR0844 polypeptide disclosed in the 
10 present application is a newly identified serine protease inhibitor and is capable of inhibiting serine proteases. 

103. FulHen^h FR0848 PoivDCPtides 

The present invention provides newly identified and isolated nucleotide sequences encoding polypeptides 
referred to in the present application as PR0848. In panicular. Applicants have identified and isolated cDNA 
IS encoding a PR0848 polypeptide, as disclosed in further detail in the Examples below. Using BLAST and FastA 
sequence alignment computer programs. Applicants found that the PR0848 p)olypepiide has sequence identity 
with sialyltransferases. Accordingly, it is presently believed thai PR0848 polypeptide disclosed in the present 
application is a newly identified member of the sialyltransferase family and possesses sialylation capabilities as 
typical of this family. 

20 

104. Full-length PRO1097 Polypeptides 

The present invention provides newly identified and isolated nucleotide sequencesencodingpolypeptides 
referred to in the present application as PROI097. In particular. Applicants have identified and isolated cDNA 
encoding a PRO 1097 polypeptide, as disclosed in further detail in the Examples below. To Applicants present 
25 knowledge, the DNA59841-1460 nucleotide sequence encodes a novel factor. Using BLAST and FastA 
sequence alignment computer programs, some sequence identity with proteins designated as **CELK05G3_3", 
**CRU26344_r, "SPBC16C6_8". "P_W13844" and -AF013403' was revealed. 

105. Full-length PR01153 Polypeptides 

30 The present invention provides newly identified and isolated nucleotide sequences encoding polypeptides 

referred to in the present application as PROl 153. In panicular, cDNA encoding a PROl 153 polypeptide has 
been identified and isolated, as disclosed in further detail in the Examples below. 

Using WU-BLAST2 sequence alignment computer programs, it has been found that a full-length native 
sequence PROl 153 (shown in Figure 246 and SEQ ID NO:351) has cenain amino acid sequence identity with 

35 HPBRII-7 protein submitted to the EMBL Data Library June 1992. Accordingly, it is presently believed that 
PROl 153 disclosed in the present application may be related to HPBRn-7. 
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106. FulMength PR011S4 Polvpep tidiHs 

The present invention provides newly identified and isolated nucleotide sequences encodingpolypepiidcs 
referred to in the present application as PROl 154. In particular. cDNA encoding a PROl 154 polypq)tide has 
been identified and isolated, as disclosed in further detail in the Examples below. 

Using WU-BLAST2 sequence alignment computer programs, it has been found that a fulMengdi native 
5 sequence PROl 154 (shown in Figure 248 and SEQ ID Nb:353) aligns with a KIAA0525 protein, designated 
AB011097. PROl 154 has a novel N-ienninus of 73 amino acids. Accordingly, PROl 154 is believed to be 
novel. PROl 154 also has significant sequence identity with aminopeptidase N, insulin-regulated membrane 
aminopeptidase, ihrotropin-releasing hormone degrading enzyme and placental leucine aminopeptidase. 
Therefore, PROl 154 is believed to be a novel aminopeptidase. or peptide which degrades peptides. 

10 

107. FulMength PROl 181 PolvngptidiHs 

The preseminveniionprovides newly identified and isolated nucleotide sequences encoding polypeptides 
referred to in the present applicauon as PROl 181 . In panicular. cDNA encoding a PROl 181 polypeptide has 
been identified and isolated, as disclosed in further detail in the Examples below. 
15 The DN A59847-151 1 clone was isolated from a human prostate tissue library using a trapping technique 

which selects for nucleotide sequences encoding secreted proteins. Thus, the DNA59847-151I clone does 
encode a secreted factor. As far as is known, the DNA59847- 1511 sequence encodes a novel factor designated 
herein as PROl 181; using the WU-BLAST2 sequence alignment computer program, no significant sequence 
identities to any known proteins were revealed. 

20 

108. FulMength PROl 182 PolvDeptidej; 

The present invention provides newly identified and isolated nucleotide sequences encodingpolypepiidcs 
referred to in the present application as PR01182. In panicular, cDNA encoding a PROl 182 polypeptide has 
been identified and isolated, as disclosed in ftirther detail in the Examples below. 
25 Using die WU-BLASt2 sequence alignment computer program, it has been found that a full-length 

native sequence PROl 182 (shown in Figure 252 and SEQ ID NO:357) has amino acid sequence identity with 
the congluiinin protein. Accordingly, it is presenUy believed that PRO 1 1 82 disclosed in the present q)pIication 
is a newly identified conglutinin homolog. 

30 109. FulMength PR0115S Polvngntide^ 

The preseminveniionprovides newly identified and isolated nucleotide sequences encoding polypeptides 
referred to in die present application as PROl 155. In particular, cDNA encoding a PROl 155 polypeptide has 
been identified and isolated, as disclosed in further detail in the Examples below. 

Using WU-BLAST2 sequence alignment computer programs, it has been found that a full-length native 
35 sequence PROl 155 (shown in Figure 254 and SEQ ID NO:359) has cenain amino acid sequence identity with 
neurokinin B. Accordingly, it is presently believed that PROl 155 disclosed in the present application is a newly 
identified member of the tachykinin family. 
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110. Full-length PR01156 Polypeptides 
The present invemion provides newly identified and isolated nucleotide sequences encoding polypeptides 
referred to in the present application as PRO! 156. In panicular, cDNA encoding a PROl 156 polypeptide has 
been identified and isolated, as disclosed in further detail in the Examples below. 

The DNA59853-1505 clone was isolated from an adult human heart library using a trapping technique 
5 which selects for nucleotide sequences encoding secreted proteins. Thus, the DNA59853-1505 clone may encode 
a secreted factor. As far as is known, the DNA59853-1505 sequence encodes a novel factor designated herein 
as PRO 1156. However, using WU-BLAST2 sequence alignment computer programs, some sequence identity 
with known proteins were revealed. 

10 111. Full-length PRO1098 Polypeptides 

The present invention provides newly identified and isolated nucleotide sequences encoding polypeptides 
referred to in the present application as PRO1098. In panicular. Applicants have identified cDN A encoding a 
PRO1098 polypeptide, as disclosed in further detail in the Examples below. The PRO1098-encoding clone was 
isolated from a human lung tissue library. To Applicants present knowledge, the DNA59854-1459 nucleotide 

15 sequence encodes a novel factor; using BLAST and FasiA sequence alignment computer programs, no significant 
sequence identities to any known proteins were revealed. Some sequence identity appeared with proteins such 
as the "Env** polyprotein and a methyltransferase. 

112. Full-length PROl 127 Polypeptides 

20 The present invention provides newly identified and isolated nucleotide sequences encoding polypeptides 

referred to in the present application as PROl 127. In particular. cDN A encoding a PROl 127 polypeptide has 
been identified and isolated, as disclosed in further detail in the Examples below. 

The DNA60283-1484 clone encodes a secreted factor. As far as is known, the DNA60283-1484 
sequence encodes a novel factor designated herein as PROl 127; using WU.BLAST2 sequence alignment 

25 computer programs, minimal sequence identities to any known proteins were revealed. 

113. Full-length PROl 126 Polypeptides 

The present invention provides newly identified and isolated nucleotide sequences encoding polypeptides 
referred to in the present application as PROl 126. In particular. cDN A encoding a PROl 126 polypeptide has 
30 been identified and isolated, as disclosed in further detail in the Examples below. 

Using the WU-BLAST2 sequence alignment computer program, it has been found that a full-length 
native sequence PROl 126 (shown in Figure 262 and SEQ ID NO: 367) has certain amino acid sequence identity 
with the olfactomedin protein. Accordingly, it is presently believed that PROl 126 disclosed in the present 
application is a newly identified olfactomedin homolog and may possess activity typical of that protein. 

35 
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114. FulHength PR0112S PolvDCPtides 

The present invention provides newly identified and isolated nucleotide sequences encoding polypeptides 
referred to in the present application as PROl 125. In panicular, cDNA encoding a PROl 125 polypeptide has 
been identified and isolated, as disclosed in further detail in the Examples below. 

Using WU-BLAST2 sequence alignment computer programs, it has been found that a full-length native 
5 sequence PROl 125 (shown in Figure 264 and SEQ ID NO:369) has certain amino acid sequence identity with 
transcriptional repressor rco-1. Accordingly, it is presently believed diat PR01125 disclosed in the present 
application is a newly identified member of the WD superfamily. 



115. Full-ien^h PRQil86 Polvpeotides 
10 The present inventionprovidesnewly identified and isolated nucleotide sequences encoding polypeptides 

referred to in the present application as PR01186. In particular, cDNA encoding a PROl 186 polypeptide has 
been identified and isolated, as disclosed in further detail in the Examples below. 

Using WU-BLAST2 sequence aligmnent computer programs, it has been foimd that a full-length native 
sequence PROl 186 (shown in Figure 266 and SEQ ID NO:371) has amino acid sequence identity with venom 
IS protein A from Dendroaspis polylepsis polylepsis venom. Accordingly, it is presently believed that PROl 186 
disclosed in the present application is a newly identified member of venom protein A and may share a related 
mechanism. 



116. Full-length PROl 198 Polypeptides 
20 The present invention provides newly identified and isolated nucleotide sequences encodingpolypeptides 

referred to in the present application as PROl 198. In particular. cDNA encoding a PROl 198 polypeptide has 
been identified and isolated, as disclosed in further deuil in the Examples below. 

As far as is known, the DNA60622-1525 sequence encodes a novel factor designated herein as 
PROl 198. However, using WU-BLAST2 sequence alignment computer programs, some sequence identity with 
25 known proteins was found. 



117. Full-Ieneth PR01158 Polypeptides 
The presem invention provides newly identified and isolated nucleotidesequences encoding polypeptides 
referred to in the presem application as PROl 158. In panicular, cDNA encoding a PROl 158 polypeptide has 
30 been identified and isolated, as disclosed in further detail in the Examples below. 

The DNA60625-1507 clone was isolated from a human lung tumor tissue library. As far as is known, 
the DNA60625-1507 sequence encodes a novel factor designated herein as PROl 158. However, using WU- 
BLAST2 sequence alignment computer programs, some sequence identities with known proteins were shown. 

35 118. FulMcngth PROl 159 Polvpepttdes 

The present invention provides newly identifiedand isolated nucleotide sequences encoding polypeptides 
referred to in the presem application as PROl 159. In panicular. cDNA encoding a PROl 159 polypeptide has 
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been identified and isolated, as disclosed in further detail in the Examples below. 

The DNA60627-1508 clone was isolated from a human peripheral blood granulocyte tissue library using 
a trapping technique which selects for nucleotide sequences encoding secreted proteins. Thus, the DNA60627- 
1508 clone does encode a secreted factor. As far as is known, the DNA60627-1S08 sequence encodes a novel 
factor designated herein as PR011S9; using the WU>BLAST2 sequence alignment computer program, no 
5 sequence identities to any known proteins were revealed. 

119. Full-length PR01124 Polypeptides 

The present inventionprovides newly identified and isolated nucleotide sequences encoding polypeptides 
referred to in the present application as PROl 124. In particular, cDNA encoding a PROl 124 polypeptide has 
10 been identified and isolated, as disclosed in further detail in the Examples below. 

Using WU-BLAST2 sequence alignment computer programs, it has been found that a fulMength native 
sequence PRO 11 24 (shown in Figure 274 and SEQ ID NO: 377) has amino acid sequence identity with an 
epithelial chloride channel protein from bos taurus. PROl 124 also has sequence identity with ECAM-1. 
Accordingly, it is presently believed that PROl 124 disclosed in the present application is a newly identified cell 
15 membrane protein involved in communication of cells either through ion channels or cell adhesion molecules. 

120. FulMength PR01287 PolvDCPtides 

The present inventionprovides newly identified and isolated nucleoiide sequences encoding polypeptides 
referred to in the present application as PR01287. In panicular, cDNA encoding a PR01287 polypeptide has 
20 been identified and isolated, as disclosed in further detail in the Examples below. 

Using the WU-BLAST2 sequence alignment computer program, it has been found that a full-length 
native sequence PRO 1287 (shown in Figure 276 and SEQ ID NO:381) has amino acid sequence identity with 
the radical fringe protein from Callus gallus (GGU82088_1 ). Accordingly, ii is presently believed that PRO 1287 
disclosed in the present application is a newly identified fringe protein homolog and may possess activity typical 
25 of the fringe protein. 

121. Full-length PR013?2 PolYpeptides 

The present invention provides newly identified and isolated nucleotide sequences encoding polypeptides 
referred to in the present application as PR01312. In panicular, cDNA encoding a PR01312 polypeptide has 
30 been identified and isolated, as disclosed in further detail in the Examples below. 

Using WU-BLAST2 sequence alignment computer programs, some sequence identities with known 
proteins were revealed, but were determined not to be significant. Therefore, as far as is known, the 
DNA61873-1574 sequence encodes a novel transmembrane protein designated herein as PR01312. 

35 122. FulMength PROl 192 Polypeptides 

The present invention provides newly identifiedand isolated nucleotide sequences encoding polypeptides 
referred to in the present application as PROl 192. in panicular, cDNA encoding a PROl 192 polypeptide has 
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been identified and isolated, as disclosed in further detail in the Examples below. 

Using WU-BLAST2 sequence alignment computer programs, it has been found that a full-length native 
sequence PRO! 192 (shown in Figure 280 and SEQ ID NO:389) has amino acid sequence identity with trout PO- 
like glycoprotein (GEN12838 IPl). Accordingly, it is presenUy believed that PROl 192 disclosed in the present 
application is a newly identified member of the myelin PO glycoprotein family. 

5 

123. Full-length PROil60 Polvpeptides 

The present invention provides newly identified and isolated nucleotide sequences encoding polypeptides 
referred to in the present application as PROl 160. In particular. cDNA encoding a PROl 160 polypeptide has 
been identified and isolated, as disclosed m fiinher detail m the Examples below. 

10 The DN A62872- 1509 clone was isolated from a human breast tissue library using a trapping technique 

which selects for nucleotide sequences encoding secreted proteins. Thus, the DNA62872-1509 clone does 
encode a secreted factor. As far as is known, the DNA62872-1509 sequence encodes a novel factor designated 
herein as PRO1I60: using the WU-BLAST2 sequence alignment computer program, no significant sequence 
identities to any laK>wn proteins were revealed. 

15 . 

124. FulNeneth PR01187 FolvDeDtides 

The present invention provides newly idcntifiedand isolated nucleotide sequences encoding polypeptides 
referred to in the present application as PROl 187. In panicular, cDNA encoding a PROl 187 polypeptide has 
been identified and isolated, as disclosed in further detail in the Examples below. 
20 As far as is known, the DNA62876-1517 sequence encodes a novel factor designated herein as 

PR01187; using WU-BLAST2 sequence alignment computer programs, no significant sequence identities to any 
known proteins were revealed. 

125. FuIMenfith PR01185 Polvi>eDtides 

25 The present invemionprovidesnewly identified and isolated nucleotide sequences encoding polypeptides 

referred to m the present application as PR01185. In particular, cDNA encoding a PROl 185 polypeptide has 
been identified and isolated, as disclosed in further detail m the Examples below. 

As far as is known, the DNA62881-1515 clone encodes a novel factor designated herein as PROl 185; 
using WU-BLAST2 sequence alignment computer programs, no significant sequence identities to any known 

30 proteins were revealed. 

126. FulNenrth PR01345 Poivpeptidi^^ 

The present invention provides newly identified and isolated nucleotide sequences encoding polype{nides 
referred to in the present application as PR01345. In particular. cDNA cncodmg a PR01345 polypeptide has 
35 been identified and isolated, as disclosed in further detail in the Examples below. 

Using the WU-BLAST2 sequence alignment computer program, it has been found that a fulMengih 
native sequence PR01345 (shown in Figure 288 and SEQ ID NO:403) has amino acid sequence identity with 
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the C-iypeleainhomolog precursor protein of bos taunis(BTU22298_l). Accordingly, it is presenUy believed 
that PR01345 disclosed in the present application is a newly identified member of the C-type lectio protein 
family and may possess activity typical of that family or of the tetraneain protein in particular. 

127. Full-length PR0124S Polv peptidP« 

The present invention provides newly identified and isolated nucleotide sequencesencodii«polypeptides 
referred to in the present application as PR01245. In panicular. cDNA encoding a PR01245 polypeptide has 
been idemified and isolated, as disclosed in further detail in the Examples below. 

The DNA64884-1527 clone was identified using methods that selects for nucleotide sequences encoding 
secretedproteins. As far as is known, the DNA64884-1527 sequence encodes a novel secreted factor designated 
herein as PR01245. Using WU-BLAST2 sequence aligmnem computer programs, some sequence identhies to 
known proteins were revealed; however, it was determined that they were not significant. 

128. FulHenrth PR013S8 P«lvnPntiH« 

The present invemionprovides newly identified and isolated nucleotide sequences encoding polypeptides 
referred to in the present application as PR01358. In particular. cDNA encoding a PR01358 polypeptide has 
been identified and isolated, as disclosed in further ctetail in the Examples below. 

Using WU-BLAST2 sequence alignmem computer programs, it has been found that a fiill-length mitive 
sequence PR01358 (shown in Figure 292 and SEQ ID N0:4 10) has amino acid sequence identity with RASP-1 . 
Accordingly, i, is presently believed that PR01358 disclosed in the present application is a newly identified 
member of the seipin family of serine protease inhibitors and may possess serine protease inhibition activity, 
protein catabolism inhibitoiy activity and/or be associated with regeneration of tissue. 

129. FulHength PROl 19S Pftly ppptiri..c 

The presem invention provides newly idemified and isolated nucleotide sequencesencodingpolypeptides 
referred to in the presem application as PROl 195. In particular. cDNA em:oding a PROl 195 polypeptide has 
been identified and isolated, as disclosed in further detail in the Examples below. 

Using WU-BLAST2 sequence alignment compmer programs, it has been found that a fiill-length native 
sequence PROl 195 (shown in Figure 294 and SEQ ID NO:412) has amim, acid sequence identity with 
MMU28486_1, termed a proline rich acidic protein from Mus musculus. locus MMU28486. Accession: 
U28486. database GBTRANS, submitted 06-JUN-1995 by John W. Kasik. Accordingly, it ispresently believed 
that PROl 195 disclosed in the present application is a newly idemified member of this protein family. 



130. Full-length PRO1270 Pnlv ppp»irip« 
The present inveraion provides newly idenufiedand isolated nucleotide sequences encoding polypeptides 
35 referred to in the presem application as PRO1270. in panicular. cDNA encoding a PRO1270 polypeptide has 
been identified and isolated, as disclosed in further detail in the Examples below. 
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Using ihe WU-BLAST2 sequence alignment computer program, it has been found tiiat a ftilMengih 
native sequence PRO1270 (shown in Figure 296 and SEQ ID NO:414) has amino acid sequence identity with 
the lectin protein (XLU86699_l) of Xenopus laevis. Accordingly, it is presently believed that PROI270 
disclosed in the present application is a newly identified member of the lectin protein family and may possess 
activity typical of that family. 

5 

131. Full-length PR01271 PolvpeDtides 
The present mvemion provides newly identified and isolated nucleotide sequencesencoding polypeptides 
referred to in the present application as PR0127I. In particular, cDNA encoding a PR01271 polypeptide has 
been identified and isolated, as disclosed in further detail in the Examples below. 
10 As far as is known, the DNA66309-1538 sequence encodes a novel factor designated herein as 

PR0127 1 ; using WU-BLAST2 seqi^nce alignment computer programs, no significant sequence identities to any 
known proteins were revealed (results further described in the examples below). 



132. FulHength PR0137S Polypeptides 

15 The present invention provides newly identified and isolated nucleotide sequences encoding polypeptides 

referred to in the present application as PR01375. In panicular, cDNA encoding a PR01375 polypeptide has 
been identified and isolated, as disclosed in further detail in the Examples below. 

Using WU-BLAST2 sequence alignment computer programs, it has been found that a full-length native 
sequence PRO 1375 (shown in Figure 300 and SEQ ID NO:418) has amino acid sequence identity PUT2. 

20 Accordingly, it is presently believed that PR01375 disclosed in the present application has at least one related 
mechanism of PUT2. 



133. Full-length PR01385 PolvDeptides 
The present invention provides newly identified and isolated nucleotide sequencesencodingpolypeptides 
25 referred to in the present application as PROI385. In panicular, cDNA encoding a PR01385 polypeptide has 
been ideiuified and isolated, as disclosed in further detail in die Examples below. 

The DNA68869-1610 clone was isolated from a htunan tissue library using a trapping technique which 
selects for nucleotide sequences encoding secreted proteins. Thus, the DNA68869-1610 clone does encode a 
secreted factor. As far as is known, the DNA68869-16I0 sequence encodes a novel factor designated herein as 
30 PROI385; using (he WU-BLAST2 sequence alignment computer program, no significant sequence identities to 
any known proteins were revealed. 



134. FulMeneth PR01387 Polypeptides 
The present invention provides newly identified and isolated nucleotide sequences encoding polypeptides 
35 referred to in the present application as PR01387. In panicular, cDNA encoding a PR01387 polypeptide has 
been identified and isolated, as disclosed in further detail in the Examples below. 
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Using the WU-BLAST2 sequence alignment computer program, it has been found that a fiilMength 
native sequence PR01387 (shown in Figure 304 and SEQ ID NO:422) has amino acid sequence identity with 
the myelin pO protein protein precursor (MYPO_HETFR). Accordingly, it is presently believed that PR01387 
disclosed in the present application is a newly identified member of the myelin protein family and may possess 
activity typical of that family. 

5 

135. FulMeneth PR01384 Polvpeptides 
The present invention provides newly identified and isolated nucleoiide sequences encoding polypeptides 
referred to in the present application as PR01384. In panicular. cDNA encoding a PR01384 polypeptide has 
been identified and isolated, as disclosed in further detail in the Examples below. 
10 Using WU-BLAST2 sequence alignment computer programs, it has been found that a fiiil-length native 

sequence PR01384 (shown in Figure 306 and SEQ ID NO:424) has amino acid sequence identity with NKG2-D 
(AF054819_1; Dayhoff database, version 35.45 SwissProt 35). Accordingly, u is presently believed that 
PR01384 disclosed in the present application is a newly identified member of the NKG2 family and may possess 
MHC activation/inactivation activities Qrpical of the NKG2 family. 

15 

B. PRO Variants 

In addition to the full-length native sequence PRO polypeptides described herein, it is contemplated that 
PRO variants can be prepared. PRO variants can be prepared by introducing appropriate nucleotide changes into 
the PRO DNA, and/or by synthesis of the desired PRO polypeptide. Those skilled in the an will appreciate that 

20 amino acid changes may alter post-translaiional processes of the PRO, such as changing the number or position 
of glycosylation sites or altering the membrane anchoring characteristics. 

Variations in the native full-length sequence PRO or in various domains of the PRO described herein, 
can be made, for example, using any of the techniques and guidelines for conservative and non-conservative 
mutations set fonh, for instance, in U.S. Patent No. 5,364,934. Variations may be a substitution, deletion or 

25 insertion of one or more codons encoding the PRO that results in a change in the amino acid sequence of the 
PRO as compared with the native sequence PRO. Optionally the variation is by substimtion of at least one amino 
acid with any other amino acid m one or more of the domains of the PRO. Guidance in determining which 
amino acid residue may be inserted, substimted or deleted without adversely affecting the desired activity may 
be found by comparing the sequence of the PRO with that of homologous known protein molecules and 

30 minimizing the number of amino acid sequence changes made in regions of high homology. Amino acid 
substimtions can be the result of replacing one amino acid with another amino acid having similar strucwral 
and/or chemical properties, such as the replacement of a leucine with a serine, i.e., conservative amino acid 
replacements. Insenions or deletions may optionally be in the range of about 1 to 5 amino acids. The variation 
allowed may be determined by systematically making insenions, deletions or substitutions of amino acids in the 

35 sequence and testing the resulting variants for activity exhibited by the full-length or manire native sequence. 

PRO polypeptide fragments are provided herein. Such fragments may be truncated at the N-ierminus 
or C-terminus, or may lack internal residues, for example, when compared with a full length native protein. 
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Certain fragments lack amino acid residues that are not essential for a desired biological activity of the PRO 
polypeptide. 

PRO fragments may be prepared by any of a number of conventional techniques. E>esired peptide 
fragments may be chemically synthesized. An alternative approach involves generating PRO fragments by 
enzymatic digestion, e.g., by treating the protein with an enzyme known to cleave proteins at sites defined by 
5 particular amino acid residues, or by digesting the DNA with suitable restriction enzyme and isolating the 
desired fragment. Yet another suitable technique involves isolating and amplifying a DNA Augment encoding 
a desired polypeptide fragment, by polymerase chain reaction (PGR). Oligonucleotides that define the desired 
termini of the DNA fragment are employed at the 5' and 3* primers in the PGR. Preferably, PRO polypeptide 
fragments share at least one biological and/or immunological activity with the native PRO polypeptide disclosed 
10 herein. 

In particular embodiments, conservative substimtions of interest are shown in Table 1 under the heading 
of preferred substitutions. If such substitutions result in a change in biological activity, then more substantial 
changes, denominated exemplary substimtions in Table 1, or as further described below in reference to amino 
acid classes, are introduced and the products screened. 
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Table 1 

Original Exemplary Preferred 

Residue Substitutions Substitutions 

5 Ala (A) val; leu; ile vai 

Arg (R) lys; gin; asn |ys 

Asn (N) gin; his; lys; arg gin 

Asp (D) glu glu 

Cys (C) ser 

10 Gln(Q) asn 

Glu (£) asp 

Gly (G) pro; ala ala 

His (H) asn; gin; lys; arg arg 

He (I) leu; val; met; ala; phe; 

15 norleucine 

Leu (L) norleucine; ile; val; 



ser 
asn 
asp 



leu 



met; ala; phe iie 
Lys (K) arg; gin; asn 

Met (M) leu; phe; ile leu 

leu 



arg 



20 Phe (F) leu; val; ile; ala; lyr 

Pro (P) ala 

Ser (S) thr 

Thr (T) ser 

Tip (W) lyr; phe 

25 Tyr (Y) trp; phe; thr; ser 



ala 
thr 
ser 
tyr 
phe 



Val (V) ile; leu; met; phe; 

ala; norleucine leu 

Substantial modifications in ftmciionor immunological identity of the PRO polypeptide are accomplished 
30 by selecting substitutions that differ significantly in their effect on maintaining (a) the sinicmre of the polypeptide 

backbone in the area of the substitution, for example, as a sheet or helical conformation, (b) the charge or 

hydrophobiciiy of the molecule at the target site, or (c) the bulk of the side chain. Namrally occurring residues 

are divided into groups based on common side-chain propenies: 

(1) hydrophobic: norleucine, met, ala, val. leu, ile; 
35 (2) neutral hydrophilic: cys, ser, thr; 

(3) acidic: asp, glu; 

(4) basic: asn. gin, his, lys. arg; 

(5) residues that influence chain orientation: gly, pro; and 

(6) aromatic: trp. tyr, phe. 

40 Non^onservaiive substitutions will entail exchanging a member of one of these classes for another class. 

Such subsunited residues also may be introduced into the conservative substitution sites or, more preferably, into 
the remaining (non-conserved) sites. 

The variations can be made using methods known in the art such as oligonucleotide-mediaied (site- 
directed) mutagenesis, alanine scanning, and PGR mutagenesis. Site-directed mutagenesis [Carteret al.. Nucl. 

45 Acids Res., 13:4331 (1986); Zoller et al., NucL Acids Res . 10:6487 (1987)J. cassette mutagenesis fWells ei 
al.. Gene. 34:3 15 (1985)], restriction selection mutagenesis [Wells ei al., Philos. Trans. R. Soc. London Ser A 
317:415 (1986)] or other known techniques can be performed on the cloned DNA lo produce the PRO variant 
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DNA. 

Scanning amino acid analysis can also be employed to identify one or more amino acids along a 
contiguous sequence. Among the preferred scanning amino acids are relatively small, neutral amino acids. Such 
amino acids include alanine, glycine, serine, and cysteine. Alanine is typically a preferred scanning amino acid 
among this group because it eliminates the side-chain beyond the beia-carbon and is less likely to alter the main* 
5 chain conformation of the variant [Cunningham and Wells, Science. 244: 1081-1085 (1989)]. Alanine is also 
typically preferred because it is the most conunon amino acid. Further, it is frequently found in both buried and 
exposed positions [Creighton, The Proteins. (W.H. Freeman & Co., N.Y.); Chothia, J. MoL Biol.. 150 :1 
(1976)]. If alanine substitution does not yield adequate amounts of variant, an isoteric amino acid can be used. 

10 C. Modifications of PRO 

Covalem nuxlifications of PRO are included within the scope of this invention. One type of covalent 
modification includes reacting targeted amino acid residues of a PRO polypeptide with an organic derivatizing 
agent that is capable of reacting with selected side chams or the N- or C- terminal residues of the PRO. 
Derivatization with bifunctional agents is useful, for instance, for crosslinking PRO to a water-insoluble support 

IS matrix or surface for use in the method for purifying anti-PRO amibodies, and vice-versa. Commonly used 
crosslinking agents include, e.g., l.l-bis(diazoacetyl)-2-phenylethane. gluiaraldehyde, N-hydroxysuccinimide 
esters, for example, esters with 4-azidosalicylic acid, homobifiinciional imidoesters, including disuccinimidyl 
esters such as 3.3'-dithiobis(succinimidylpropionaie). bifunctional maleiraides such as bis-N-maleimido-1,8- 
octane and agents such as methyl-3-[(p-azidophenyl)dithio]propioimidate. 

20 Other modifications include deamidation of glutaminyl and asparaginyl residues to the corresponding 

glutamyl and aspartyl residues, respectively, hydroxylation of proline and lysine, phosphorylation of hydroxyl 
groups of seryl or threonyl residues, methylaiion of the a-amino groups of lysine, arginine, and histidine side 
chains [T.E. Creighton, Proteins: Strucnire and Molecular Properties. W.H. Freeman & Co., San Francisco, 
pp. 79-86 (1983)], acetylation of the N-terminal amine, and amidation of any C-tcrminal carboxyl group. 

25 Another type of covalent modification of the PRO polypeptide included within the scope of this 

invention comprises altering the native glycosyiation panem of the polypeptide. "Altering the native 
glycosylation panem" is intended for purposes herein to mean deleting one or more carbohydrate moieties found 
in native sequence PRO (either by removing the underlying glycosylation site or by deleting the glycosylation 
by chemical and/or enzymatic means), and/or adding one or more glycosylation sites that are not present in the 

30 native sequence PRO. In addition, the phrase includes qualitative changes in the glycosylation of the native 
proteins, involving a change in the nature and proportions of the various carbohydrate moieties present. 

Addition of glycosylation sites to the PRO polypeptide may be accomplished by altering the amino acid 
sequence. The alteration may be made, for example, by the addition of, or substitution by, one or more serine 
or threonine residues to the native sequence PRO (for O-linked glycosylation sites). The PRO amino acid 

35 sequence may optionally be altered through changes at the DNA level, particularly by mutating the DNA 
encoding the PRO polypeptide at preselected bases such that codons are generated that will translate into the 
desired amino acids. 
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Another means of increasing the number of carbohydrate moieties on the PRO polypeptide is by 
chemical or enzymatic coupling of glycosides to the polypeptide. Such methods are described in the an, e.g., 
in WO 87/05330 published 1 1 September 1987, and in Apiin and Wrision, CRC Crit. Rev. Biochem.. pp. 259- 
306 (1981). 

Removal of carbohydrate moieties present on the PRO polypeptide may be accomplished chemically 
5 or enzymatically or by mutational substitution of codons encoding for amino acid residues that serve as targets 
for glycosyiation. Chemical deglycosylation techniques are known in the an and described, for instance, by 
Hakimuddin, et al.. Arch. Biochem. Biophvs.. 259:52 (1987) and by Edge et al.. Anal. Biochem.. 118:131 
(1981). Enzymatic cleavage of carbohydrate moieties on polypeptides can be achieved by the use of a variety 
of endo- and exo-glycosidases as described by Thotakura et al., Mcth. Enzvmol.. 138:350 (1987). 

10 Another type of covalent modification of PRO comprises linking the PRO polypeptide to one of a variety 

of nonproteinaceous polymers, e.g.. polyethylene glycol (PEG), polypropylene glycol, or polyoxyalkylenes, in 
the manner set forth in U.S. Patent Nos. 4,640,835; 4.496,689; 4,301,144; 4,670,417; 4,791, 192 or 4, 179,337. 

The PRO of the present invention may also be modified in a way to form a chimeric molecule 
comprising PRO fused to another, heterologous polypeptide or amino acid sequence. 

15 In one embodiment, such a chimeric molecule comprises a fusion of the PRO with a ug polypeptide 

which provides an epitope to which an anti-tag antibody can selectively bind. The epitope tag is generally placed 
at the amino- or carboxyl- terminus of the PRO. The presence of such epitope*tagged forms of the PRO can be 
detected using an antibody against the tag polypeptide. Also, provision of the epitope tag enables the PRO to 
be readily purified by affinity purification using an anti-tag antibody or another type of affinity matrix that binds 

20 to the epitope tag. Various tag polypeptides and their respective antibodies are well known in the an. Examples 
include poly -hist id ine (poly-his) or poly-hisiidine-glycine (poly-his-gly) tags; the flu HA lag polypeptide and its 
antibody 12CA5 (Field et al., Mol. Cell. Biol. . 8:2159-2165 (1988)); the c-myc tag and the 8F9, 3C7, 6E10, 
04, B7 and 9E10 antibodies thereto [Evanei al.. Molecular and Cellular Biology. 5:3610-3616 (1985)]; and the 
Herpes Simplex virus glycoprotein D (gD) tag and its antibody [Paborsk^ et al.. Protein Engineering. 3(6):547- 

25 553 (1990)]. Other tag polypeptides include the Flag-pepiide |Hopp et al., BioTechnoloev. 6:1204-1210 
(1988)]; the KT3 epitope peptide [Manin et al.. Science . 255:192-194 (1992)1; an a-mbulin epitope peptide 
[Skinner et al., J. Biol. Chem. . 266:15163-15166 (1991)]; and the T7 gene 10 protein peptide tag [Lutz- 
Freyermuth et al., Proc. Natl. Acad. Sci. USA. 87:6393-6397 (1990)J. 

In an alternative embodiment, the chimeric molecule may comprise a fusion of the PRO with an 

30 immunoglobulin or a particular region of an inmiunoglobulin. For a bivalent form of the chimeric molecule (also 
referred to as an ''inunimoadhesin*'), such a fusion could be to the Fc region of an IgG molecule. The Ig fusions 
preferably include the substitution of a soluble (transmembrane domain deleted or inactivated) form of a PRO 
polypeptide in place of at least one variable region within an Ig molecule. In a particularly preferred 
embodiment, the immunoglobulin fusion includes the hinge, CH2 and CH3, or the hinge, CHI, CH2 and CH3 

35 regions of an IgG 1 molecule. For the production of immunoglobulin fusions see also US Patent No. 5,428, 130 
issued June 27, 1995. 
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D. PrenararmnnfPPO 

The description below relaies primarily to production of PRO by culniring cells transformed or 
transfected with a vector containing PRO nucleic acid. It is, of course, contemplated that alternative methods, 
which are well known in the art, may be employed to prepare PRO. For instance, the PRO sequence, or 
portions thereof, may be produced by direct pepude synthesis using solid-phase techniques [see, e.g., Stewart 
5 Solid-Phase Peptide Synthesis, W.H. Freeman Co., San Francisco, CA (1969); Merrifield, J.Am.Chem. 

S^ 85:2149-2154 (1963)]. In vitro protein synthesis may be performed using manual techniques or by 
automation. Automated synthesis may be accomplished, for instance, using an Applied Biosysiems Peptide 
Symhesizer (Foster City, CA) using manufacmrer's instructions. Various ponions of the PRO may be 
chemically syndiesized separately and combined using chemical or enzymatic methods to produce the fiill-lengUi 
10 PRO. 
■ 

- 1. Isolation of DNA Encodinp PRO 
DNA encoding PRO may be obtained from a cDNA libraiy prepared from tissue believed to possess 
the PRO mRNA and to express it at a detectable level. Accordingly, human PRO DNA can be conveniently 
15 obtained from a cDNA library prepared from human tissue, such as described in the Exan^jles. The PRO- 
encoding gene may also be obtamed from a genomic libraiy or by known synthetic procedures (e.g., automated 
nucleic acid synthesis). 

Libraries can be screened with probes (such as antibodies to the PRO or oligonucleotides of at least 
about 20-80 bases) designed to identify the gene of interest or the protein encoded by it. Screening the cDNA 

20 or genomic library with the selected probe may be conducted using sundard procedures, such as described in 
Sambrook ei al.. Molecular Cloning: A Labo ratory Manual (New York: Cold Spring Harbor Uboratory Press, 
1989). An alternative means to isolate the gene encoding PRO is to use PCR methodology fSambrook et al., 
sufira; Dicffenbach et al., PCR Primer: A Laboratory Manual (Cnld Spring Harbor Uboratory Press, 1995)]. 
The Examples below describe techniques for screening a cDNA library. The oligonucleotide sequences 

15 selected as probes should be of sufficient length and sufficiently unambiguous that false positives are minimized. 
The oligonucleotide is preferably labeled such that it can be detected upon hybridization to DNA in the library 
being screened. Methods of labeling are weU known in the an, and include the use of radiolabels like ^-P-labeled 
ATP, bioiinylation or enzyme labeling. Hybridization conditions, including moderate stringency and high 
stringency, are provided in Sambrook et al., supra. 

\0 Sequences identified in such library screening methods can be compared and aligned to other known 

sequences deposited and available in public databases such as GenBank or other private sequence databases. 
Sequence identity (at either the amino acid or nucleotide level) within defined regions of the molecule or across 
the full-length sequence can be determined using methods known in the an and as described herein. 

Nucleic acid having protein coding sequence may be obtained by screening selected cDNA or genomic 

►5 libraries using the deduced amino acid sequence disclosed herein for the first time, and, if necessary, using 
convemional primer extension prqpedures as described in Sambrook et al., supra , to detect precursors and 
processing intermediates of mRNA that may not have been reverse-transcribed into cDNA. 
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2- Selection and Transformation of Hnsi Trik 
Host cells are transfected or transformed with expression or cloning vectors described herein for PRO 
production and cultured in conventional nutrient media modified as appropriate for inducing promoters, selecting 
transfonnants. or amplifying the genes encoding the desired sequences. The culture conditions, such as media, 
temperanire, pH and the like, can be selected by the skilled anisan without undue experimentation. In general, 
5 principles, protocols, and practical techniques for maximizing the productivity of cell cultures can be found in 
Mammalian Cell BiotechnoloBy: a Practical Approayft M. Butler, ed. (IRL Press. 1991) and Sambrook et al.. 
supra . 

Methods of eukaiyotic cell transfection and prokaiyotic cell transformation are known to the ordinarily 
skilled artisan, for example. CaCl,. CaPO.. liposome-mediated and electroporation. Depending on the host cell 
10 used, transformation is performed using standard techniques appropriate to such cells. The calcium treatment 
employing calcium chloride, as described in Sambrook et al.. supra, or electroporation is generally used for 
prokaiyotes. Infection with Agrobaaerium lumefaciens is used for transformation of certain plant cells, as 
described by Shaw etal.. Gene. 23:315 (1983) and WO 89/05859 published 29 June 1989. For mammalian cells 
without such cell walls, the calcium phosphate precipitation method of Graham and van der Eb. Virology . 
15 52:456^57 (1978) can be employed. General aspects of mammalian cell host system transfections have been 
described in U.S. Paiem No. 4.399.216. Transformations into yeast arc typically carried out according to the 
method ofVanSolingen etal.. OacL. 130:946 (1977) and Hsiao ctal., Prcc. Natl. Acad .Srj , fi i5!A) i^ia^o 
(1979). However, other methods for introducing DNA into cells, such as by nuclear microinjection, 
electroporation. bacterial protoplast fusion with intact cells, or polycaiions, e.g., polybrene, polyomithine. may 
also be used. For various techniques for transforming mammalian cells, see Keown et al.. Methods in 
Enzymolofiv. 185:527-537 (1990) and Mansour et al.. Nature. 336:348-352 (1988). 

Suitable host cells for cloning or expressing the DNA in the vectors herein include prokaryote. yeast, 
or higher eukaryote cells. Suitable prokaiyotes include but are not limited to eubacteria. such as Gram-negative 
or Gram-positive organisms, for example. Enterobacteriaceae such as E. coli. Various E. coU strains are 
25 publicly available, such as £. coU K12 strain MM294 (ATCC 31.446); E. coliXMU (ATCC 31.537); £. coli 
strain W3110 (ATCC 27.325) and K5 772 (ATCC 53.635). Other suitable prokaiyotic host «j1Is include 
Enterobacteriaceae such as Escherichia, e.g.. E. coU, Enierobacier. Ermnia, Klebsiella, Proteus, Salmonella, 
e.g.. Salmonella typhimurium. Serratia. e.g.. Serrmia marcescans, and Shigella, as well as BaciUi such as B. 
subiilis and B. licheniformis (e.g.. B. lichenifomis 41P disclosed in DD 266.710 published 12 April 1989). 
30 Pseudomonas such as />. aeruginosa, and Strepiomyces . These examples are illustrative rather than limiting. 
Strain W31 10 is one panicularly preferred host orparem host because i. is a common host strain for recombinant 
DNA product fermentations. Preferably, the host cell secretes minimal amounts of proteolytic enzymes. For 
example, strain W31 10 may be modified to effect a genetic mutation in the genes encoding proteins endogenous 
to the host, with examples of such hosts including £. coli W31 10 strain I A2. which has the complete genotype 
35 lonA ; E. coli W3I10 strain 9E4. which has the complete genotype lonA ptr3; E. coli W31 10 strain 27C7 
(ATCC 55.244). which has the complete genotype lonApirS phoAElS iargF-lac)J69degPompTkarf ; E. coli 
W31 10 strain 37D6. which has the complete genotype /on^p/rJ|,/,o^£/5 (argF-lac)169 degP ompT rbs7 
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ilvG kaif \ E, coU W3U0 strain 40B4. which is strain 37D6 with a non-kanamycin resistant degP deletion 
mutation; and an £. coti strain having mutant periplasmic protease disclosed in U.S. Patent No. 4,946,783 issued 

7Augustl990. Alternatively, w vi/to methods of cloning, e.g.. PCR or other nucleic acid polymerase reactions 
are suiuble. 

In addition to prokaryotes. eukaryotic microbes such as fdamentous fungi or yeast are suitable cloning 
5 or expression hosts for PRO-encoding vectors. Saccharomyces cerevisiae is a commonly used lower eukaryotic 
host microorganism. Others include Schizosaccharomyces pombe (Beach and Nurse, Nature . 290: 140 [1981 J; 
EP 139,383 published 2 May 1985); Kluyveromyces hosts (U.S. Patent No. 4,943,529; Fleer et al., 
BioyTechnoloRv, 9:968-975 (1991)) such as, e.g., K, lactis (MW98.8C. CBS683. CBS4574; Louvencoun ei al., 
J. BacterioL , 737 [1983]). K, fragiUs (ATCC 12.424). K. bulgaricus (ATCC 16,045), AT. wickeramii (ATCC 
10 24.178), K. walrii (ATCC 56,500), K. drosophilarum (ATCC 36,906; Van den Berg et al.. Bio/Technology, 
8:135 (1990)). K, thermoioierans, and K. marxiams: yarrowia (EP 402,226); Pichia pastoris (EP 183,070; 
Sreekrishna et al..-J. Basic Microbiol.. 28:265-278 [1988]); Candida; Trichoderma reesia (EP 244.234); 
Neurospora crassa (Case et al., Proc. Natl. Acad. Sci TI.SA 76:5259-5263 [1979]); Schwanniomyces such as 
Schwanniomyces occidemaiis (EP 394,538 published 31 October 1990); and filamentous ftmgi such as, e.g., 
15 Neurospora. PeniciUium, ^y/wjc/offiim (WO 91/00357 published 10 Januao^ AspergiUus hosts such 

as A. niduUms (Ballance et al.. Biochem. Biophvs. Res. Commun ] 12:284-289 [1983]; Tilbum et al.. Gene . 
26:205-221 [1983]; Yeltonetal., Proc. Natl. Acad. Sci. USA 81: 1470-1474 [1984]) and A, m^^r (Kelly and 
Hynes, EMBOJ., 4:475-479 [ 1985]). Methyloiropic yeasts are suitable herein and include, but are not limited 
to, yeast capable of growth on methanol selected from die genera consisting of Hansemda, Candida. Kloeckera. 
20 Pichia, Saccharomyces. Torulopsis. and Rhodotorula. A list of specific species that are exen^Iary of this class 
of yeasts may be found in C. Anthony, The Biochemistry of Methvlotrophs. 269 (1982). 

Suiuble host cells for the expression of glycosylated PRO are derived from multicellular organisms. 
Examples of invertebrate cells include insect cells such as Drosophila S2 and Spodoptera Sf9, as well as plant 
cells. Examples of useful mammalian host cell lines include Chinese hamster ovary (CHO) and COS cells. 
25 More specific examples include monkey kidney CVl line transformed by SV40 (COS-7, ATCC CRL 1651); 
human embryonic kidney line (293 or 293 cells subcloned for growth in suspension culmre, Graham ei al., 

> 36:59 (1977)); Chinese hamster ovary cellsADHFR (CHO, Urlaub and Chasin, Proc. Natl. Acad. 
ScLUSA. 77:4216 (1980)); mouse Sertoli cells (TM4, Mather. Biol. Reorod.. 23:243-251 (1980)): human lung 
cells (W138. ATCC CCL 75); human liver ceUs (Hep G2, HB 8065); and mouse mammary tumor (MMT 
30 060562, ATCC CCL51). The selection of die appropriate host cell is deemed to be within the skill in the an. 



3. Selection and Use of a RenHcahl^ Vi^rmr 
The nucleic acid (e.g. , cDN A or genomic DNA) encoding PRO may be inserted into a replicable vector 
for cloning (amplification of the DNA) or for expression. Various vectors are publicly available. The vector 
35 may. for example, be in the form of a plasmid, cosmid, viral panicle, or phage. The appropriate nucleic acid 
sequence may be inserted into the vector by a variety of procedures. In general, DNA is inserted into an 
appropriate restriction endonuclease site(s) using techniques known in Uic an. Vector components generally 

353 



BNSOOCID: <WO 9963088A2.L> 



wo 99/63088 

wyy/www PCT/US99/12252 

include, but are not limited to. one or more of a signal sequence, an origin of replication, one or more marker 
genes, an enhancer element, a promoter, and a transcription termination sequence. Construction of suitable 
vectors containing one or more of these components employs standard ligation techniques which are known to 
the skilled artisan. 

The PRO may be produced recombinantly not only directly, but also as a fusion polypeptide with a 
5 heterologous polypeptide, which may be a signal sequence or other polypeptide having a specific cleavage site 
ai the N-terminus of the manire protein or polypeptide. In general, the signal sequence may be a component of 
the vector, or it may be a part of the PRO-encoding DNA that is insened into die vector. The signal sequence 
may be a prokaryotic signal sequence selected, for example, from the group of the alkaline phosphatase, 
penicillinase. 1pp. or heat-stable enterotoxin 11 leaders. For yeast secretion the signal sequence may be. e.g.. 

10 the yeast invertase leader, alpha factor leader (including Saccharomyces and Kluyvewmyces a-factor leaders, 
the latter described in U.S. Patent No. 5.010.182). or acid phosphause leader, the C. albicans glucoamylase 
leader (EP 362.179 published 4 April 1990). or the signal described in WO 90/13646 published 15 November 
1990. In mammalian cell expression, mammalian sigmd sequences may be used to direct secreuon of the 
protein, such as signal sequences from secreted polypeptides of the same or related species, as well as viral 

15 secretory leaders. 

Both expression and cloning vectors contain a nucleic acid sequence that enables the vector to replicate 
in one or more selected host cells. Such sequences are well known for a variety of bacteria, yeast, and viruses. 
The origin of replication from die plasmid pBR322 is suitable for most Gram-negative bacteria, the 2^ plasmid 
origin is suitable for yeast, and various viral origins (SV40. polyoma, adenovirus. VSV or BPV) are useful for 
20 cloning vectors in mammalian cells. 

Expression and cloning vectors will typically contain a selection gene, also termed a selectable marker. 
Typical selection genes encode proteins diat (a) confer resistance to antibiotics or other toxins, e.g.. ampicillin. 

neomycin, methotrexate.or tetracycline, (b)complement auxotrophic der.ciencies.or(c)supply critical nutrients 
not available from complex media, e.g.. the gene encoding D-alaninc racemase for Bacilli. 

25 An example of suitable selectable markers for mammalian ceUs are those that enable the idemification 

of cells competent to take up the PRO-encoding nucleic acid, such as DHFR or thymidine kinase. An 
appropriate host cell when wild-type DHFR is employed is the CHO cell line deficiera in DHFR activity, 
prepared and propagated as described by Urlaubetal.. Proc. Natl. Acari iiqa 7-7.^->.a^.o.o^ A suitable 
selection gene for use in yeast is the irp\ gene present in the yeast plasmid YRp7 {Stinchcomb et al.. Namre . 

30 282:39 (1979): Kingsman et al.. Gene. 7:141 (1979); Tschemper et al.. Gene. 10:157 (1980)]. The trp\ gene 
provides a selection marker for a mutant strain of yeast lacking the ability to grow in tryptophan, for example. 
ATCC No. 44076 or PEP4-1 (Jones. Genetics. 85:12 (1977)J. 

Expression and cloning vectors usually contain a promoter operably linked to the PRO-encoding nucleic 
acid sequence to direct mRNA synthesis. Promoters recognized by a variety of potemial host cells are well 

35 known. Promoters suitable for use with prokaryotic hosts include the p-lactamase and lactose promoter systems 
[Chang et al.. Nature. 275:615 (1978); Goeddel et al.. Nasure. 281:544 (1979)]. alkaline phosphatase, a 
tryptophan (trp) promoter system [Goeddel. Nucleic Acid» Rpc 8:4057 (1980); EP 36.776), and hybrid 
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promoters such as the tac promoter [deBoer et al., Proc. Natl. Acad. Sci. USA 80:21-25 (1983)J. Promoiers 
for use in bacterial ^sterns also will contain a Shine-Dalgamo (S.D.) sequence operably linked to the DNA 
encoding PRO. 

Examples of suitable promoting sequences for use with yeast hosts include the promoters for 3- 
phosphoglycerate kinase [Hiizeman et al.. J. Biol. Chem:. 255:2073 (1980)) or other glycolytic enzymes [Hess 
5 «t al., J. Adv. Enzyme Rep., 7:149 (1968); Holland, Biochemistry . 17:4900 (1978)]. such as enolase. 
glyceraldehyde-S-phosphate dehydrogenase, hexokinase.pynivatedecarboxylase.phosphofructokinase.glucose- 
6.phosphate isomerase, 3-phosphoglycerate mutase. pyruvate kinase, iriosephosphateisomerase.phosphoglucose 
isomerase, and glucokinase. 

Other yeast promoters, which are inducible promoters having the additional advantage of transcription 
10 controlled by growth conditions, are the promoter regions for alcohol dehydrogenase 2, isocyiochrome C. acid 
phosphatase, degradative enzymes associated with nitrogen metabolism, metallothionein. glyceraldehyde-3- 
phosphate dehydrogenase, and enzymes responsible for maltose and galactose utilization. Suitable vectors and 
promoters for use in yeast expression are ftinher described in EP 73.657. 

PRO transcription from vectors in mammalian host cells is controlled, for example, by promoters 
15 obtained from the genomes of viruses such as polyoma virus, fowlpox virus (UK 2,21 1,504 published 5 July 
1989). adenovirus (such as Adenovirus 2), bovine papilloma virus, avian sarcoma virus, cytomegalovirus, a 
retrovirus. hepatitis-B virus and Simian Vims 40 (SV40), from heterologous mammalian promoiers, e.g., the 
actin promoter or an immunoglobulin promoter, and from heat-shock promoters, provided such promoters are 
compatible with the host cell systems. 
20 Transcription of a DNA encoding the PRO by higher eukaryoies may be increased by inserting an 

enhancer sequence into the vector. Enhancers are cis-aciing elements of DNA. usually about from 10 to 300 
bp, that act on a promoter to increase its transcription. Many enhancer sequences are now known from 
mammalian genes (globin, elastase, albumin, a-feioprotein. and insulin). Typically, however, one will use an 
enhancer from a eukaryotic cell virus. Examples include the SV40 enhancer on the late side of the replication 
25 origin (bp 100-270). the cytomegalovirus early promoter enhancer, the polyoma enhancer on the late side of the 
replication origin, and adenovirus enhancers. The enhancer may be spliced into the vector at a position 5' or 
3* to the PRO coding sequence, but is preferably located at a site 5* from the promoter. 

Expression vectors used in eukaryotic host cells (yeast, fungi, insect, plant, animal, human, or nucleated 
cells from other multicellular organisms) will also contain sequences necessary for the termination of 
30 transcription and for stabUizing the mRNA. Such sequences arc commonly available from the 5* and. 
occasionally 3', untranslated regions of eukaryotic or viral DNAs or cDNAs. These regions contain nucleotide 
segments transcribed as polyadenylated fragments in the untranslated portion of the mRNA encoding PRO. 

sail other methods, vectors, and host cells suitable for adaptation to the ^thesis of PRO in 
rax)mbinani venebrate cell culture are described in Gething et al.. Nature . 293:620-625 (1981); Maniei et al., 
35 Nanire. 281:40^6 (1979); EP 117,060; and EP 117,058. 
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4. Detecting Gene Amplificati on/Expression 
Gene amplification and/or expression may be measured in a sample directly, for example, by 
conventional Southern blotting. Northern blotting to quaniitate the transcription of mRN A [Thomas, Proc. Natl. 
Acad. Sci. USA. 77:5201-5205 (1980)]. dot blotting (DNA analysis), or in situ hybridization, using an 
appropriately labeled probe, based on the sequences provided herem. Alternatively, antibodies may be employed 
5 that can recognize specific duplexes, including DNA duplexes. RNA duplexes, and DNA-RNA hybrid duplexes 
or DNA-proiein duplexes. The antibodies in turn may be labeled and die assay may be carried out where the 
duplex is bound to a surface, so that upon the formation of duplex on the surface, the presence of antibody bound 
to the diq>lex can be detected. 

Gene expression, alternatively, may be measured by inununological mediods. such as 
10 immunohistochemical staining of cells or tissue sections and assay of cell culture or body fluids, to quantitate 
directly the expression of gene product. Antibodies useful for immunohistochemical staining and/or assay of 
sample fluids may be either monoclonal or polyclonal, and may be prepared in any mammal. Conveniently, the 
antibodies may be prepared against a native sequence PRO polypeptide or against a synthetic peptide based on 
the DNA sequences provided herein or against exogenous sequence ftised to PRO DNA and encoding a specific 
15 antibody epitope. 

5. Purification of Polypeptide 
Forms of PRO may be recovered from culture medium or from host cell lysates. If membrane-bound, 

it can be released from the membrane using a suitable detergent solution (e.g. Triion-X 100) or by enzymatic 
20 cleavage. Cells employed in expression of PRO can be disrupted by various physical or chemical means, such 

as freeze-thaw cycling, sonication. mechanical disruption, or cell lysing agents. 

It may be desired to purify PRO fi-om recombinant cell proteins or polypeptides. The following 

procedures are exemplary of suitable purification procedures: by fractionation on an ion-exchange column; 

edianol precipitation; reverse phase HPLC; chromatography on silica or on a cation-exchange resin such as 
25 DEAE; chromatofocusing; SDS-PAGE; ammonium sulfate precipitation; gel filtration using, for example. 

Sephadex G-75; protein A Sepharose colunms to remove contaminants such as IgG; and metal chelating columns 

to bind epitope-tagged forms of the PRO. Various methods of protein purification may be employed and such 

methods are known in the an and described for example in Deutscher. Methods in Enzvmologv. 182 (1990); 

Scopes. Protein Purification: Princit)les and Practice Springer- Veriag, New York (1982). The purification 
30 step(s) selected will depend, for example, on the nature of the production process used and the particular PRO 

produced. 

E. Uses for PRO 

Nucleotide sequences (or their complement) encoding PRO have various applications in the an of 
35 molecular biology, including uses as hybridization probes, in chromosome and gene mapping and in the 
generation of anti-sense RNA and DNA. PRO nucleic acid will also be useful for the preparation of PRO 
polypeptides by the recombinant techniques described herein. 
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The ftilMength native sequence PRO gene, or portions thereof, may be used as hybridization probes 
for a cDNA library to isolate the full-length PRO cDNA or to isolate still other cDNAs (for instance, those 
encoding naturally-occurring variants of PRO or PRO from other species) which have a desired sequence identity 
to the native PRO sequence disclosed herein. Optionally, the length of the probes will be about 20 to about 50 
bases. The hybridization probes may be derived from at least panially novel regions of the full length native 
5 nucleotide sequence wherein those regions may be determined without undue experimentation or from genomic 
sequences including promoters, enhancer elements and intions of native sequence PRO. By way of example, 
a screening method will comprise isolating die coding region of the PRO gene using the known DNA sequence 
to synthesize a selected probe of about 40 bases. Hybridization probes may be labeled by a variety of labels, 
including radionucleotides such as ^P or ^S, or enzymatic labels such as alkaline phosphatase coupled to the 
10 probe via avidin/bioiin coupling systems. Labeled probes having a sequence complementary to that of die PRO 
gene of the present invention can be used to screen libraries of human cDNA, genomic DNA or mRNA to 
determine which members of such libraries die probe hybridizes to. Hybridization techniques are described in 
further detail in the Examples below. 

Any EST sequences disclosed m the present application may similarly be employed as probes, using 

IS the methods disclosed herein. 

Odier useful fragments of the PRO nucleic acids include aniisense or sense oligonucleotides comprising 
a singe-stranded nucleic acid sequence (eidier RNA or DNA) capable of binding to target PRO mRNA (sense) 
or PRO DNA (antisense) sequences. Aniisense or sense oligonucleotides, according to the present invention, 
comprise a fragment of the coding region of PRO DNA. Such a fragment generally comprises at least about 14 

20 nucleotides, preferably from about 14 to 30 nucleotides. The ability to derive an antisense or a sense 
oligonucleotide, based upon a cDNA sequence encoding a given protein is described in, for example. Stein and 
Cohen (Cancer Res. 48:2659. 1988) and van der Krol et al. ( BioTechniques 6 :958. 1988). 

Binding of antisense or sense oligonucleotides to target nucleic acid sequences results in the formation 
of duplexes that block transcription or translation of the target sequence by one of several means, including 

25 enhanced degradation of die diq)lexcs, premanire termination of transcription or translation, or by oUier means. 
The amisense oligonucleotides dius may be used to block expression of PRO proteins. Antisense or sense 
oligonucleotides ftmher comprise oligonucleotides having modified sugar-phosphodiester backbones (or other 
sugar linkages, such as diose described in WO 91/06629) and wherein such sugar linkages are resistant to 
endogenous nucleases. Such oligonucleotides widi resistant sugar linkages are stable in vivo (i.e.. capable of 

30 resisting enzymatic degradation) but retain sequence specificity to be able to bind to target nucleotide sequences. 

Odier examples of sense or antisense oligonucleotides include diose oligonucleotides which are 
covalently linked to organic moieties, sudi as diose described in WO 90/10048, and odier moieties diat increases 
affinity of die oligonucleotide for a target nucleic acid sequence, such as poly-(L-lysine). Punher still, 
intercalating agents, such as ellipticine, and alkylating agents or metal complexes may be attached to sense or 

35 antisense oligonucleotides to modify binding specificities of die antisense or sense oligonucleotide for die target 
nucleotide sequence. 
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Antisense or sense oligonucleotides may be introduced into a cell containing the target nucleic acid 
sequence by any gene transfer method, including, for example. CaPO.-mediated DNA transfcction. 
electroporation. or by using gene transfer vectors such as Epstein-Barr virus. In a preferred procedure, an 
antisense or sense oligonucleotide is inserted into a suitable retroviral vector. A cell containing die target nucleic 
acid sequence is contacted with the recombinant retroviral vector, either in vivo or ex vivo. Suitable retroviral 
5 vectors include, but are not limited to. diose derived from the murine retrovirus M-MuLV, N2 (a retrovirus 
derived from M-MuLV). or the double copy vectors designated DCT5A, DCT5B and DCT5C (see WO 
90/13641). 

Sense or antisense oligonucleotides also may be introduced into a cell containing the target nucleotide 
sequence by formation of a conjugate with a ligand binding molecule, as described in WO 91/04753. Suitable 
10 ligand binding molecules inchide. but are not limited to, cell surface receptors, growth factors, other cytokines, 
or other ligands that bind to cell surface receptors. Preferably, conjugation of the ligand binding molecule does 
i»t substantially interfere with the ability of the ligand binding molecule to bind to its corresponding molecule 
or receptor, or block entry of die sense or antisense oligonucleotide or its conjugated version into the ceU. 

Alternatively, a sense or an antisense oligonucleotide may be introduced into a cell containing the target 
15 nucleic ffl:id sequence by formation of an oligonucleotide-lipid complex, as described in WO 90/10448. The 
sense or antisense oligonucleotide-lipid complex is preferably dissociated within the cell by an endogenous lipase. 

The probes may also be employed in PCR techniques to generate a pool of sequences for identification 
of closely related PRO coding sequences. 

Nucleotide sequences encoding a PRO can also be used to construct hybridization probes for nian>ing 
20 the gene which encodes that PRO and for the genetic analysis of individuals with genetic disorders. The 
nucleotide sequences provided herein may be mapped to a chromosome and specific regions of a chromosome 
using known techniques, such as in situ hybridization, linkage analysis against known chromosomal markers, 
and hybridization screening with libraries. 

When die coding sequences for PRO encode a protein which binds to another protein (example, where 
25 the PRO is a receptor), the PRO can be used in assays to idemify the odier proteins or molecules involved in the 
binding interaction. By such methods, inhibitors of die receptor/ligand binding interaction can be identified. 
Proteins involved in such binding interactions can also be used to screen for peptide or small molecule inhibitors 
or agonists of the binding interaction. Also, the receptor PRO can be used to isolate correlative ligand(s). 
Screening assays can be designed to fmd lead compounds that mimic the biological activity of a native PRO or 
30 a receptor for PRO. Such screening assays will include assays amenable to high-throughput screening of 
chemical libraries, making diem particularly suiuble for identifying small molecule drug candidates. Small 
molecules contemplated include syndietic organic or inorganic compounds. The assays can be performed in a 
variety of formats, including protein-protein binding assays, biochemical screening assays, immunoassays and 
ceU based assays, which are well characterized in die an. 
35 Nucleic acids which encode PRO or its modified forms can also be used to generate eidier transgenic 

animals or "knock out" animals which, in mm. are useftil in die developmem and screening of dierapeutically 
useful reagents. A transgenic animal (e.g.. a mouse or rat) is an animal having cells diat contain a transgene. 
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which transgene was introduced into the animal or an ancestor of the animal at a pienatal. e.g.. an embryonic 
suge. A transgene is a DNA which is integrated into the genome of a cell from which a transgenic animal 
develops. In one embodiment, cDNA encoding PRO can be used to clone genomic DNA encoding PRO in 
accordance with established techniques and the genomic sequences used to generate transgenic animals that 
contain cells which express DNA encoding PRO. Methods for generating transgenic animals, particularly 
5 animals such as mice or rats, have become conventional in the art and are described, for example, in U.S. Patem 
Nos. 4.736.866 and 4.870.009. Typically, particular cells would be targeted for PRO transgene incorporation 
with tissue-specific enhancers. Transgenic animals that include a copy of a transgene encoding PRO introduced 
into the germ line of the animal at an embryonic stage can be used to examine the effect of increased expression 
of DNA encoding PRO. Such animals can be used as tester animals for reagents thought to confer protection 
10 from, for example, pathological conditions associated with its overexpression. In accordance with this facet of 
the invention, an animal is treated witii die reagem and a reduced incidence of die pathological condition, 
compared to untreated animals bearing the transgene. would indicate a potential therapeutic intervention for the 
pathological condition. 

Alternatively, non-human homologues of PRO can be used to constiuct a PRO "knock out" animal 
15 which has a defective or altered gene encoding PRO as a result of homologous recombination between the 
endogenous gene encoding PRO and altered genomic DNA encoding PRO imroduced into an embryonic stem 
cell of the animal. For example. cDNA encoding PRO can be used to done genomic DNA encoding PRO in 
accordance with established techniques. A portion of the genomic DNA encoding PRO can be deleted or 
replaced with another gene, such as a gene encoding a selectable marker which can be used to monitor 
20 imegration. Typically, several kilobases of unaltered flanking DNA (both at the 5' and 3' ends) are included 
in the vector |see e.g.. Thomas and Capecchi. £eU. 51:503 (1987) for a description of homologous 
recombination vectors]. The vector is introduced into an embryonic stem cell line (e.g.. by electroporation) and 
cells in which the introduced DNA has homologously recombined with the endogenous DNA are selected [see 
e.g.. Li et al.. CeM. 69:915 (1992)]. The selected cells are then injected into a blastocyst of an animal (e.g.. 
> a mouse or rat) to form aggregation chimeras (see e.g.. Bradley, in Teratocarcinomas and Embryonic Stem 
Cells: A Praaical Approach, E. J. Robertson, ed. (IRL. Oxford. 1987). pp. 1 13-1521. A chimeric embryo can 
then be implanted into a suitable pseudopregnant female foster animal and die embryo brought to term to create 
a "knock out" animal. Progeny harboring die homologously lecombincd DNA in their germ cells can be 
identified by standard techniques and used to breed animals in which all cells of die animal comain die 
I homologously recombined DNA. Knockout animals can be characterized for instance, for dieir ability to defend 
against cettain pathological conditions and for dieir development of pathological conditions due to absence of 
die PRO polypeptide. 

Nucleic acid encoding die PRO polypeptides may also be used in gene tiierapy. In gene therapy 
applications, genes are imroduced into cells in order to achieve in vivo syndiesis of a therapeutically effective 
genetic product, for example for replacemem of a defeaive gene. "Gene tiierapy- includes botii conventional 
gene dierapy where a lasting effect is achieved by a single treatment, and die administration of gene dierapemic 
agents, which involves tiie one time or repeated adminiso-ation of a dierapeutically effective DNA or mRNA. 



359 



PCT/US99/122S2 

Antisense RN As and DNAs can be used as therapeutic agents for blocking the expression of cenain genes in 
vivo. It has already been shown that shon antisense oUgonucIeotides can be imponed into ceUs where they act 
as inhibitors, despite their low iniraceUular concentrations caused by their restricted uptake by the cell 
membrane. (Zamecnik a at. , Proc. Natl. Acad Sri irsA 83:4143-»146 (19861). The oligonucleotides can be 
modified to enhance their uptake, e.g. by substinuing their negatively charged phosphodiester groups by 
S uncharged groups. 

There are a variety of techniques avaUable for introducing nucleic acids into viable cells. The 
techniques vary depending upon whether the nucleic acid is transferred into cultured cells in vitro, or in vivo in 
the cells of the intended host. Techniques suitable for the transfer of nucleic acid into mammalian cells in vitro 
include die use of liposomes, electroporation. microinjection, cell fusion. DEAE-dextran. the calcium phosphate 

10 precipitation method, etc. The currendy preferred in vivo gene transfer techniques include transfcction widi viral 
(typically retroviral) vectors and viral coat protein-liposome mediated transfcction (Dzau et al., Trends in 
Biotechnology 1 1, 205-210 [1993]). In some simations it is desirable to provide the nucleic acid source with 
an agem diat targets die target cells, such as an antibody specific for a cell surface membrane protein or the 
target ceU. a ligand for a receptor on die target cell. etc. Where liposomes are employed, proteins which bind 

15 to a cell surface membrane protein associated widi endocytosis may be used for urgeting and/or to facUitate 
uptake, e.g. capsid proteins or fragments diereof tropic for a panicular cell type, antibodies for proteins which 
undergo imemalizationincycling. proteins diat target intracellular localization and enhance mtracellular half-life. 
The technique of receptor-mediated endocytosis is described, for example, by Wu et al.. J. Biol. Chem 262, 
4429-4432 (1987); and Wagner et al.. Proc. Nad. Acad .Sri ika 87. 34 10-3414 (1990). For review of gene 

20 marking and gene dierapy protocols see Anderson et al.. Science 236. 808-813 (1992). 

The PRO polypqitides described herein may also be employed as molecular weight markers for protein 
electrophoresis purposes. 

The nucleic acid molecules encoding die PRO polypeptides or fragments diereof described herem are 
useful for chromosome idemification. In dris regard, diere exists an ongoing need to idemiiy new chromosome 

25 markers, since relatively few chromosome marking reagents, based upon actual sequence dau are prescmly 
avaUable. Each PRO nucleic acid molecule of die present invemion can be used as a chromosome marker. 

The PRO polypeptides and nucleic acid molecules of die present invention may also be used for tissue 
typing, wherein die PRO polypeptides of die present invention may be differemially expressed in one tissue as 
compared to anodier. PRO nucleic acid molecules will find use for generating probes for PCR. Northern 

(0 analysis. Soudiem analysis and Western analysis. 

The PRO polypeptides described herein may also be employed as dierapeutic agents. The PRO 
polypeptides of die present invention can be formulated according to known mediods to prepare pharmaceutically 
useful compositions, whereby die PRO product hereof is combmed in admixnire widi a pharmaceutically 
acceptable carrier vehicle. Therapeutic formulations are prepared for storage by mixing die active ingredient 

5 having die desired degree of purity widi optional physiologically accepuble carriers, excipients or stabilizers 
(Remington's Phaymaceurical .«?cie^iccs 16di edition. Osol. A. Ed. (1980)). in die form of lyophilized 
formulations or aqueous solutions. Acceptable carriers, excipients or stabilizers are nontoxic to recipients at die 
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dosages and concentrations employed, and include buffers such as phosi*aie. citrate and other organic acids: 
antioxidants including ascorbic acid; low molecular weight (less than about 10 residues) polypeptides; proteins, 
such as serum albumin, gelatin or immunoglobulins: hydrqjhHic polymers such as polyvinylpyrrolidoiw. amino 
acids such as glycine, glutamine. asparagine. arginine or lysine; monosaccharides, disaccharides and otiicr 
carbohydrates including glucose, mannosc. or dextrins; chelating agents such as EDTA; sugar alcohols such as 
5 mannitol or sorbitol; salt-forming counterions such as sodium; and/or nonionic surfactants such as TWEEN™. 
PLURONICS™ or PEG. 

The formulations to be used for in vivo administration must be sterile. This is readily accomplished by 
fUtration through sterile filtration membranes, prior to or following lyophilization and reconstinition. 

Therapeutic compositions herein generally are placed into a container having a sterile access pon. for 
10 example, an intravenous solution bag or vial having a stopper pierceable by a hypodermic injection needle. 

The route of administration is in accord with known methods, e.g. injection or infusion by intravenous, 
intraperitoneal, intracerebral, intramuscular, intraocular, intraarterial or imralesional routes, topical 
adnunistration. or by sustained release systems. 

Dosages and desired drug concentrations of pharmaceutical compositions of the present invention may 
15 vary depending on the particular use envisioned. The determination of the apprq)riate dosage or route of 
administration is well witiiin tiie skill of an ordinary physician. Animal experiments provide reliable guidance 
for the determination of effective doses for human therapy. Interspecies scaling of effective doses can be 
performed following the principles laid down by Mordenti. J. and Chappell. W. 'The use of interspecies scaling 
in toxicokinetics" In Toxicokinetics and New Dnig Developmcm. Yacobi et al.. Eds.. Pergamon Press. New 
20 York 1989. pp. 42-96. 

When in vivo administration of a PRO polypeptide or agonist or antagonist thereof is employed, normal 
dosage amounts may vary from about 10 ng/kg to up to 100 mg/kg of mammal body weight or more per day, 
preferably about 1 Mg/kg/day to 10 mg/kg/day. depending upon the route of administration. Guidance as to 
panicular dosages and metitods of delivery is provided in the literature; see. for example. U.S. Pat. Nos. 

25 4.657.760; 5.206,344; or 5,225.212. It is anticipated that different formulations will be effective for different 
treatment compounds and differem disorders, that administration targeting one organ or tissue, for example, may 
necessitate delivery in a manner differem from dut to another organ or tissue. 

Where sustained-release administration of a PRO polypeptide is desired in a formulation with release 
characteristics suitable for the treatment of any disease or disorder requiring administration of the PRO 

30 polypeptide, microencapsulation of the PRO polypeptide is contemplated. Microencapsulation of recombinant 
proteins for sustained release has been successfully performed witit human growth hormone (rhGH). interferon- 
(rtilFN- ). interleukin-2. and MN rgpl20. Johnson et al., Nat. Med. . 2:795-799 (1996); Yasuda. Biomed. 
BSL. 27:1221-1223 (1993); Horaetal.. Bio/Technolopv R-7SS.7S»noonv ri.io«w "Design and Production 
of Single Immunization Vaccines Using Polylactide Polyglycolide Microsphere Systems." in Vaccine Design: 

35 The Subunit and Adjuvam Approach Powell and Newman, eds. (Plenum Press: New Yoric. 1995). pp. 439-462; 
WO 97/03692. WO 96/40072, WO 96/07399; and U.S. Pat. No. 5,654,010. 
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The sustained-release formulaiions of these proteins were developed using poly.|actic-coglycolic acid 
(PLGA) polymer due to m biocompatibUity and wide range of biodegradable properties. The degradation 
products of PLGA. lactic and glycolic acids, can be cleared quickly within the hunuai body. Moreover, the 
degradability of this polymer can be adjusted from months to years depending on its molecular weight and 

composition. Unvis. -Controlled release ofbioactive agents from lactide/glycolide polymer." in: M.Chasinand 

5 R. Unger (Eds.). Biodegradable Polymers « Dn.o nMi».^ . c^., ^.^^^^^ j^^^^. 

1-41. 

TOs invention encompasses methods of screening compounds to identify those that mimic the PRO 
polypeptide (agonisu) or prevent the effect of the PRO polypeptide (antagonists). Screening assays for 
antagonist drug candidates are designed to identify compounds that bind or complex with the PRO polypeptides 
10 encoded by the genes identified herein, or otherwise interfere wiU, the interaction of the encoded polypeptides 
with other ceUular proteins. Such screening assays will include assays amenable to high-throughput screening 
of chemical libraries, making diem particularly suitable for idemifying small molecule drug candidates. 

The assays can be performed in a variety of formats, including protein-protein binding assays, 
biochemical screening assays, immunoassays, and cell-based assays, which are well characterized in the art. 

AU a^ays for antagonists are cominon in that they call for contacting the drug candidate widi a PRO 
polypeptide encoded by a nucleic acid identified herein under conditions and for a time sufficient to allow these 
two components to interact. 

In binding assays, the interaction is binding and the complex formed can be isolated or detected in the 
reaction mixture. In a particular embodiment, the PRO polypeptide encoded by the gene identified herein or the 
drug candidate is immobilized on a solid phase, e.g.. on a microtiter plate, by covalent or non-covalem 
attachments. Non-covalent anachmem generally is accomplished by coating the solid surface with a solution of 
the PRO polypeptide and during. Alternatively, an immobilized antibody, e.g.. a monoclonal antibody, specific 
for the PRO polypeptide to be immobilized can be used to anchor i, to a solid surface. The assay is performed 
by adding the non-immobilized component, which may be labeled by a detectable label, to the immobilized 
component, e.g.. the coated surface containing the anchored component. When the reaction is complete, the 
non-reacted components are removed, e.g.. by washing, and complexes anchored on tiie solid surface are 
detected. When the originally non-immobilized component carries a detectable label, the detection of label 
umnobUized on the surface indicates that complexing occurred. Where the originally non-immobilized 
component does not carry a label, complexing can be detected, for example, by using a labeled amibody 
30 specifically binding the immobilized complex. 

^ If die candidate compound interacts witii but does not bind to a particular PROpolypeptide encoded by 
a gene identified herein, its imeraction with that polypeptide can be assayed by methods weU known for detecting 
protein-protein interactions. Such assays include traditional approaches, such as. e.g.. cross-linking, co- 
immunoprecipitation. and co-purification through gradients or chromatographic colutnns. In addition, protein- 
protein interactions can be monitored by using a yeast-based genetic system described by Fields and co-workers 
(Fields and Song. Nature (Undon) 340:245-246 (1989); Chien et al.. Proc. N..> Ar.rf 88:9578- 
9582 (1991)) as disclosed by Chevray.and Nathans. Proc. Natl. Acrt ik^ .,o. ..cn c,«, (p^,) 
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.nmscriptional activators. s»ch as yeas. GAU. consist of two physically discrete modular domains, one acting 
as the DNA-bmdii« domain, the other one fimctioiung as d,e transcripuon-activation domain. The yeas, 
expression system described in the foregoing publications (generally referred to as the 'two-hybrid system") 
takes advantage of dus property, and employs two hybrid proteins, one in which the targe, protein is fused to 
the DNA-bi«ling domain of GAU. and anod,er. in which candida.e ac.ivanng proteins are fused u, the 
5 activation domain. The expression ofaGALl-ZacZ reporter gene muler control ofaGAL*-acdva.ed promoter 
depends on recons.itu.ion of GAM activity via protein-protein interaction. Colonies containing interacting 
polypeptides are detected wi* a chromogenic subsoate for P-galactosidase. A complete kit 
(MATCHMAKER-) for identifying protein-protein interactions benveen two specific proteim using u,e two- 
hybnd technique is commercially available from Clontech. This system can also be extended to map protein 
10 domams involved in specific protein interactions as well as to pinpoim amino acid residues that are crucial for 
these interactions. 

Compounds that interfere wiU, d,e in.erac.ion of a gene encodmg a PRO polypep.ide identified herein 
and od,er inn-a- or ex.raceUular component can be .es.ed as follows: usually a reacion mixnare is prepared 
conu.mng Ae pnxluct of the gene and the intra- or extracellular componem under conditions and for a time 
15 allowmg for d.e mterac.ion and binding of dte nvo products. To test d,e ability of a candidate compound .o 
inh.b.. bmdmg. *e reaction is run in d,e absence and in Ae presence of Ae .es. compound. In addi.ion a 
placebo may be added to a third reaction mixture, to serve as positive comrol . The binding (complex formation) 
between Ae test compound and Ae intra- or extracellular componem presem in the mixmre is monitored as 
descnbed hereinabove. The formation of a complex in Ae comrol reac.ion(s) bm no. in Ae reaction mixmre 

20 ~««'n»g*'^«es.compoundindicatesAatAe,es.compoundin,erfereswiA.hein.erac.ionofAe^^^ 

and its reaction partner. 

To assay for anugonisu. Ae PRO polypeptide may be added .o a cell along wiA Ae compound to be 

screened foraparticular activity and Ae ability of Ae compound to inhibit Ac activity of interest in Aepresemte 
of Ae PRO polypeptide indicates Aa. Ae compound is an an.agonis. .o Ae PRO polypeptide. Alternatively 
. anugonists may be detected by combining Ae PRO polypeptide and a potemial antagonis. wiA membrane-bound 
PRO poIypep.ide recep.ors or recombinan. receptors under appropriate conditions for a competitive inhibition 
assay. The PRO polypeptide can be labeled, such as by radioactivity, such Aat Ae number of PRO polypeptide 
molecules bound to Ae receptor can be used to determine Ae effectiveness of Ae potemial antagonist. The gene 
encoding Ae receptor can be identified by numerous meAods known to those of skill in Ae an. for example 

Lgand pamung and FACS sorting. Coligan et al.. Current PrntornU in . ,(,): chapter 5 (1991). 

Preferably, expression cloning is employed wherem polyadenylated RNA is prepared from a cell responsive to 
the PRO polypeptide and a cDNA library created from Ais RNA is divided m.o pools ami used .o .ransfect COS 
cells or oAer cells Aa. are no. responsive to Ae PRO polypep.ide. Transfected cells Aa. are grown on glass 
slides are exposed to labeled PRO polypep.ide. The PRO polypeptide can be labeled by a variety of means 
mcludmg iodination or inclusion of a recognition site for a site-specific protein kinase. Following fixation and 
UKubauon. Ae slMes are subjected to autoradiographic amilysU. Positive pools are identified and sub-pools are 
prepared and re-.ransfected using an imeractive sub-pooling and re-screening process, evenmally yielding a 
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Single clone that encales the pinaiive receptor. 

As an alternative appn«ch for receptor identification, labeled PRO polypeptide can be photoaffinity- 
intked wth cell tnetnbrane or extract preparadons that express d« receptor moleode. C^ss-Iinked material is 
resolved by PAGE and exposed to X-ray film. The labeled coniplex containing Ae receptor can be excised 
«»lvedmto peptide fragments, and subjected to protein micro-sequencing. The amino acid sequence obtained 
5 ftom micro, sequencing would be used to design a set of degenerate oUgonucleotide p^bes to screen a cDNA 
library to identify the gene encoding the putative receptor. 

In anoAer assay for antagonists, mammalian cells or a membrane preparation expressing the receptor 
would be incubated with labeled PRO polypeptide in the presence of the candidate compound. THe ability of 
the compound to enhance or block this interaction could then be measured. 
10 More specific examples of potential anugonists include an oligonucleotide .ha, binds to the fusions of 

unmunoglobulin with PRO polypeptide, and. in particular, antibodies including, without limitation, poly- and 
monoclomd antibodies and amibody fragments, single^n antibodies, anti-idiotypic antibodies, and chimeric 
or humanized versions of sudt antibodies or fragm«us. as well as human antibodies and antibody fragments 
Alternatively, a potemiai antagonist may be a closely related protein, for example, a muuted form of the PRO 
15 P°'yP«=P'i''e«hatrecognizesthereceptorbuti.^arts«,effect..here^ 
PRO polypeptide. 

Another potential PRO polypeptide antagonist is an antisense RNA or DNA construct prepared using 
anttsense technology, where, e.g. . an antisense RNA or DNA molecule acts to block dinxUy the translation of 
mRNA by hybridizing to targeted mRNA and preventing protein translation. Antisense technology can be u^ 
20 to control gene expression through triple-helix formation or amisense DNA or RNA. both of which methods are 
based on binding of a polynucleotide to DNA or RNA. For example, .he 5' coding ponionof Uu. polynucleotide 
sequence, which encodes the mature PRO polypeptides herein, is used to design an antisense RNA 
oligonucleotide of from about 10 to 40 base pairs in leng*. A DNA oligonucleotide is desiened to be 
~"V'emen.arytoaregionof.hegeneinvolvedintranscription(triplehelix-seel^« NudAcidsRe^ 
25 6:3073 (1979); Cooney et al.. 241: 456 (1988); Dervan et al.. Sdence, 251:1360 (1991)) thereby 

preventing transcription and the production of the PRO polypeptide. THe antisense RNA oligonucleotide 
hybndizes to the mRNA in W«, and blocks translation of Ute mRNA molecule into the PRO polypeptide 
(antisense - Okano. NeutacbenL. 56:560 (1991); 01i.odeoxv...H^.^., ...... 

ExEression (CRC Press: Boca Raton. FL. 1988). The oligonucleotides described above can also be delivered 
30 «, cells such dutt the antisense RNA or DNA may be expressed in Wv. to inhibit production of Uu. PRO 
polypepude.WhenantisenseDNAisused.oligodeoxyribonucleo.idesderivedfromthe^^^ 
eg., between about -10 and H- 10 positions of the target gene nucleotide sequence, are preferred. 

Potemiai antagonists include small molecules that bind to the active site. ti« receptor binding site or 
growth factor or other relevant binding site of the PRO polypeptide, .hereby blocking the normal biological 
35 activity of the PRO polypeptide. Examples of small molecules include, but are not limited to. small peptides 
or peptide-like molecules, preferably soluble peptides, and symhetic non-pepUdyl orgamc or inorganic 
compounds. 
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Ribozymes are enzymatic RNA molecules capable of caialyzing the specific cleavage of RNA. 
Ribozymes act by sequeiure-specifichybridizationto the complementaiyuirgetRNA. follow^ 
cleavage. Specific ribozyme cleavage sites within a potential RNA target can be identified by kn^ 
For ftirther detaUs see. e.g. . Rossi. Cmt Biology. 4:469^71 (1994). and PCT publication No. WO 97/33551 
(published September 18. 1997). * 

Nucleic acid molecules in triple-helix formation used to inhibit transcription should be single-stranded 
and composed of deoxynucleoudes. The base composition of these oligonucleotides is designed such that i, 
promotes triple-helix formation via Hoogsteen base-pairing rules, which generally require sizeable stretches of 
purines or pyrimidines on one strand of a duplex. For further details see. e.g.. PCT publication No. WO 
97/33551. supra. 

niese small molecules can be identified by any one or more of the screening assays discussed 
hereinabove and/or by any other screening techniques well known for diose skilled in the art. 

PR0189 can be used in assays with W01A6.1 of C. Elegans. phosphodiesterases, transponers and 
proteins which bind to fatty acids, to determine the relative activities of PR0189 against these proteins. The 
results can be aj^lied accordingly. 

P- Anti-PRO Amihnritf t 

The presem invention farther provides anU-PRO antibodies. Exemplary antibodies include polyclonal, 
monoclonal, humanized, bispecific. and heteroconjugate antibodies. 

1 Polyclonal Antih,i.^f^„ 

The anti-PRO antibodies may comprise polyclonal antibodies. Methods of preparing polyclonal 
antibodies are known ,o the skilled anisan. Polyclonal antibodies can be raised in a mammal, for example, by 
one or more injections of an immunizing agent and. if desired, an adjuvant. Typically, the immunizing agent 
and/or adjuvant wUl be injected in die mammal by multiple subcutaneous or intraperitoneal injections. The 
immunizing agent may include the PRO polypeptide or a fusion protein thereof It may be useful to conjugate 
the immunizing agent to a protein known to be immunogenic in die mammal being immmiized. Examples of 
such immum^enic proteins include but are no, limited to keyhole limpet hemocyanin. serum albumin, bovine 
diyroglobulin. and soybean trypsin inhibuor. Examples of adjuvants which may be employed include Freund s 
complete adjuvant and MPL-TDM adjuvam (monophosphoryl Lipid A. synthetic trehalose dicorynomycolate). 
The immunization protocol may be selected by one skilled in the an without undue experimentation. 
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2- Monoclonal Anfi^^jyy 
The ami-PRO antibodies may. alternatively, be monoclonal antibodies. Monoclonal amibodies may be 
prepared using hybridoma methods, such as those described by Kohler and MiUtein. Nature. 256:495 (1975). 
35 In a hybridoma method, a mouse, hamster, or other appropriate host animal, is typically immunized with an 
immunizing agent to elicit lymphocytes Uu« produce or are capable of producing antibodies that wUI specifically 
bind to the immunizing agem. Altemauvely. the lymphocytes may be immunized in vitro. 
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The immunizing agent will typically include the PRO polypeptide or a fusion protein thereof. 
Generally, either peripheral blood lymphocytes CPBLs") are used if ceUs of human origin are desired, or spleen 
cells or lymph node ceUs are used if non-human mammalian sources are desired. The lymphocytes are then 
fused wid, an immortalized cell line using a suitable fusing agem. such as polyethylene glycol, to form a 
hybridomacell [CSoding. fejfonoclonal Antibodies: PrinriniP. ..h P»..i.. Academic Press. (1986) pp. 59-103]. 
5 Immortalized cell lines are usually transformed mammalian cells, particularly myeloma cells of rodem. bovine 
and human origin. UsuaUy. rat or mouse myeloma cell lines are employed. The hybridoma cells may be 
cultured in a suitable culmre medium that preferably contains one or more substances that inhibit the growth or 
survival of die unfused. immortOized cells. For example, if the parental cells lack the enzyme hypoxan&ine 
guanine phosphoribosyl transferase (HGPRT or HPRT). the culmre medium for the hybridomas typically will 
10 include hypoxamhine. aminopterin. and diymidine ("HAT medium'), which substances prevem the growth of 
HGPRT-deficient cells. 

Preferred immortalized cell lines are those that fuse efficiently, suppon stable high level expression of 
antibody by the selected antibody-producing cells, and are sensitive to a medium such as HAT medium. More 
preferred immortalized ceU lines are murine myeloma lines, which can be obtained, for instance, from tf,e Salk 
15 Institute Cell Distribution Center, San Diego. California and the American Type Culture Collection. Manassas. 
Virginia. Human myeloma and mouse-human heteromyeloma cell lines also have been described for die 
productionofhumanmonoclonalantiboditsnCozbor. LimffiSlDfiL. 122:3001 (1984); Brodeuretal.. Monoclnn.1 
Antibody Production Technigtms and AppHc.tion^ Marcel Dekker. Inc.. New York. (1987) pp. 51-63). 

The culture medium in which the hybridoma cells are cultured can dwn be assayed for die presence of 
20 monoclonal antibodies directed against PRO. Preferably, die binding specificity of monoclonal antibodies 
produced by the hybridoma cells is determined by immunoprecipitation or by an in wro binding assay, such as 
radioimmunoassay (RIA) or enzyme-linked immunoabsorbent assay (EUSA). Such techniques and assays are 
known in die an. n« binding affinity of dte monoclonal antibody can. for example, be determined by die 
Scatchard analysis of Munson and PoUard, Anal. Biochen, i07:220 (1980). 
25 After die desired hybt^doma ceUs are identified, die clones may be subcloned by limiting dilution 

procedures and grown by standard medHxb [Coding. Suitable culture media for diis purpose include 

forexample. Dulbecco's Modified Eagle's Medium and RPMI- 1640 medium. Alternatively, die hybridoma cells 
may be grown in ww as ascites in a mammal. 

The monoclonal antibodies secreted by die subclones may be isolated or purified from die culmre 
30 medium or ascites fluid by conventional immunoglobulin purification procedures such as. for example, protein 
A-Sepharose. hydroxylapatite chromatography, gel electrophoresis, dialysis, or affinity chromatography. 

The monoclonal antibodies may also be made by recombinam DNA methods, such as diose described 
in U.S. Patem No. 4.816.567. DNA encoding die monoclonal antibodies of U« mvention can be readily isolated 
and sequenced using conventional procedures (e.g.. by using oligonucleotide probes diat are capable of binding 
35 specifically to genes encoding die heavy and light chains of murine antibodies). The hybridoma ceUs of die 
invention serve as a preferred source of such DNA. Once isolated, die DNA may be placed imo expression 
vectors, which are dientransfected into host cells such as simian COS cells. Chi««e hamster ovary(CHO)ceUs. 
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or myelonu, ceUs iha, do no, otherwise produce imnninoglobulin protein. .0 oteain ihe syndesis of monoclonal 
antibodies in the recombinant host cells. The DNA also may be modified, for example, by substimting .he 
coding sequence forhumanheavy and Ught chainconstantdomains inplaceof the homologous muri^ 
[U.S. Patent No. 4.816.567; Morrison e« al.. ^ or by covalemly joining to the immunoglobulin coding 

sequence all orpan of the coding sequence foranon-immunoglobulinpolypepude.Sucha.K,n^ 
polypeptide can be subsdtmedfortheconstaMdomaim of an antibody of d,einve«ion. or can be sute^ 
the variable domains of one antigen-combining site of an antibody of .he invemion 10 create a chimeric bivalent 

antibody. 

TT.e antibodies may be monovalent antibodies. Methods for preparing monovalem antibodies are well 

known in die ait. For example, one method involves recombinant expressionofimmunoglobulin light cb^ 
modified heavy chain. The heavy chain is uuncated generally at any poim in the Fc region so as .0 prevent 
heavy chain crosslinking. Alternatively. d« relevant cysteine residues axe substituted with another amino acid 
residue or are deleted so asm prevent crosslinking. 

In vino methods are also suiuble for preparing monovalem antibodies. Digestion of antibodies to 
produce fragmems tiie«of. particularly. Fab fragmems. can be accomplished using routine techniques known 
IS in die art. 



3 Human and Humanized Afitilwij>.c 

The anti-PRO antibodies of die mvention may further comprise humanized antibodies or human 
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antibodies. Humanized forms of non-human (e.g.. murine) antibodies are chimeric immunoglobulins 
immunoglobulin chains or fragments thereof (such as Fv. Fab. Fab'. F(ab'), or odier antigen-binding 
subsequences of antibodies) which contain minimal sequence derived from non-human immunoglobulin. 
Humanized amibodies include human immunoglobulins (recipient antibody) in which residues from a 
complemenary determining region (CDR) of the recipient are replaced by residues from a CDR of a non-human 
species (donor antibody) such as mouse, rat or rabbit having the desired specificity, affinity and capacity. In 

25 ^'neinsunces.FvframeworkresiduesofUiehumanimmunoglobulinarerepIacedbycorrespondingnon-human 
residues. Humanized antibodies may also comprise residues which are found neidier in die recipient antibody 
nor in die imported CDR or framework sequences. In general, die humanized antibody wUl comprise 
substamially all of at leas. one. and typically wo, variable domains, in which all or subsum.ially all of die CDR 
regions correspond to diose of a non-human immunoglobulin and all or subsum.ially all of die FR regions are 

30 diose of a human immunoglobulin consensus sequence. The humanized an.ibody optimally also will comprise 

at least a portion of an immmioglobulin constant region (Fc). typically diat of a human immmioglobulin fJones 

etal.. Nature. 221:522-525(1986); Riechmamietal.. fiiatfflS. 2^:323-329(1988); and Presu. Curr. On. ..m.c. 
SifiL, 2:593-596 (1992)J. 

Metfiods for humanizing non-human antibodies are weU known in die an. Generally, a humanized 
35 antibody has one or more amino acid residues introduced into it from a source which is non-bmnan. These non- 
human amino acid residues are often referred ,0 as -import" residues, which are typically uJcen from an -impon" 
variable domain. Humanization can be essentially performed following die meUiod of Winter and co-worken 
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[Jones et al.. Nanire. 321:522-525 (1986); Riechmaim et al.. Nature. 232:323-327 (1988); Verhoeyen et al.. 
Sasnce, 239:1534-1536 (1988)1. by subsUniiing rodent CDRs or CDR sequences for the conesponding 
sequences ofa human antibody. Accordingly, such 'hunianized" amibodies are chimeric antibodies (U.S. Patent 
No. 4.816,567). wherein substantially less than an intact human variable domain has been substinued by (he 
corresponding sequence from a non-human species. In practice, humanized antibodies are typically human 
5 antibodies in which some CDR residues and possibly some FR residues are substituted by residues from 
analogous sites in rodent antibodies. 

Human antibodies can also be produced usmg various techniques known in the an. including phage 
display libraries [Hoogenboom and Winter. J. Mol. Biol.. 227:381 (1991); Marks et al.. J. Mol. Bini 222:581 
(1991)). The techniques of Cole et al. and Boemer et al. are also available for ±e preparation of human 
10 monoclonal antibodies (Cole « al. . Monoclonal Anti hodies and Carrpr Th.r.py Alan R. Liss. p. 77 (1985) and 
Boemer et al.. J. Immunol. . J47ai:86-95 (1991)]. Similarly, human antibodies can be made by introducing 
of human immunoglobulin loci into transgenic animals, e.g.. mice in which the endogenous immunoglobulin 
genes have been partially or completely mactivated. Upon challenge, human antibody production is observed, 
which closely resembles tiiat seen in humans m all respects, including gene rearrangement, assembly, and 
15 antibody repenoire. This approach is described, for example, in U.S. Patem Nos. 5.545.807; 5.545.806; 
5.569.825; 5.625.126; 5.633.425; 5.661.016. and m the following sciemific publications: Marks et al., 
a!2fli£iinal2g3!.Jfl. 779-783 (1992); Lonberg etal.. Nafias 268856-859(1994); Morrison. Iiaturc26S, 812-13 
(1^); FishwUd etal.. Nature Biotechnolo|7 J4. 845-51 (1996); Neuberger. Nature Biotechn»lnav 826 
(1996); Lonberg and Huszar. Intern. Rev. Immunol. |3 fi-S-Q^ (1995). 

20 

4. Bispecific Antibodies 
Bispecific antibodies are monoclonal, preferably human or humanized, antibodies that have binding 
specificities for at least two different antigens. In die present case, one of the binding specificities is for the 
PRO. the other one is for any ptiier antigen, and preferably for a cell-surface protein or receptor or receptor 
15 subunit. 

Methods for making bispecific antibodies are known in the art. Traditionally, the recombinant 
production of bispecific antibodies is based on the co-expression of two immunoglobulin heivy-chain/lig^ 
pairs, where the two heavy chams have differem specificities [Milstein and Cuello. Nanire . 305 :537-539 { 1983)] . 
Because of the random assortmem of tanmunoglobulin heavy and light chains, these hybridomas (quadromas) 
0 produce a potemial mixmre of ten different antibody molecules, of which only one has die correct bispecific 
strucnire. The purification of the correct molecule is usually accomplished by affinity chromatography susps. 
Similar procedures are disclosed in WO 93/08829. published 13 May 1993. and in Traunecker et al.. EMBO 
L.ifl:3655-3659(1991). 

Antibody variable domains widi die desired bindmg specificities (antibody-antigen combining sites) can 
5 be fused to immunoglobulin constant domam sequem«s. The fiision preferably is with an immunoglobulin 
heavy-chain constam domain, comprising at least pan of die hinge. CH2. and CH3 regions. It U preferred to 
have die first heavy<hain constant region (CH 1) cpntaming die site necessary for light-cham binding presem. in 
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at least one of the fusions. DNAs encoding the immunoglobulin heavy-chain fusions and. if desired, the 
immunoglobulin light chain, are insened into separate expression vectore. and are co-transfected into a suitable 

hostorganism. For fimherdetaib of generating bispeciftc antibodies sec. for example. Sureshetal.. Method. 
m Enzvmoteev 121:210 (1986). 

According to another appioach described in WO 96/2701 1. the interface between a pair of antibody 
molecules can be engineered to maximize .he percentage of heterodimers which are recoveied from recombinant 
cell culture. The preferred interface comprises at least a pan of the CH3 region of an antibody constant domain. 
In this method, one or more small amino acid side chains from the interface of the first antibody molecule are 
replaced with larger side chains (e.g. tyrosine or tryptophan). Compensaton^ "cavities" of identical or similar 
size to the large side chain(s) are created on the interface of the second antibody molecule by replacing large 
amino acid side chains with smaller ones (e.g. alanine or threonine). This provides a mechanism for increasing 
the yield of the hetcrodimer over other unwanted end-products such as homodimers. 

Bispecific antibodies can be prepared as fiill length antibodies or antibody fragments (e.g. F(ab-), 
bispecific antibodies). Techniques for generating bispecific antibodies from antibody fragments have been 
described in the literamre. For example, bispecific antibodies can be prepared can be prepared usine chemical 
lii*age.Bremum«fl/..Scien^229:81(1985)describeaprocedure wherein intaa 

cleaved to generate F(ab'), fiagmems. These fragmems are reduced in the presence of the dithiol complexing 
agent sodium arsenite to stabilize vicin^Tdithiols and prevem imermolecular disulfide formation. The Fab- 
fragments generated are then convened to thionitrobenzoate (TNB) derivatives. One of the Fab'-TNB 
derivatives is then reconvened to the Fab'-duol by reduaion with mercaptoethylamine and is mixed with an 
equimolar amount of the od,er Fab'-TNB derivative to fonn the bispecific antibody. Tbc bispecific antibodies 
produced can be used as agents for the selective immobilization of enzymes. 

Fab- fragments may be directly recovered from E. coli and chemically coupled to fonn bispecific 
antibodies. Shalaby e, cl., J. Exp. Med. 175:217-225 (1992) describe the producUon of a fiJly humanized 
bispecific antibody F(ab'), molecule. Each Fab' fragmem was separately secreted from E. coli and subjected 
to directed chemical coupling in vitro to form the bispecific antibody. The bispecific anubody thus formed was 
able to bind to cells overexpressing the ErbB2 receptor and nonnal human T cells, as well as trigger the lytic 
activity of human cytotoxic lymphocytes against human breast mmor targets. 

Various technique for maldng and isolating bispecific antibody fragments directly from recombinant ceH 
culmie have also been described. For example, bispecific antibodies have been produced using leuci^^ 
Kostelny « d., JJnsmunoL 148(5): 1547-1553 (1992). TT,e leucine zipper peptides from die Fos and Jun 
pn>teins were linked to the Fab' ponions of two different antibodies by gene fusion. TTie antibody homodimers 
were reduced at the hinge region to fonn monomers and then renixidized to form the antibody heterodimers. 
This method can also be utilized for die production of antibody homodimers. The "diabody" technology 
described by Hollinger « al., Proc. Natl. Acad Sri !rSA 90:6444-6448 (1993) has provided an alternative 
mechanism for making bispecific antibody fragmems. The fragments comprise a heavy-chain variable domain 
(V„) comtecied to a light-chain variable domain (V,) by a linker which is too shon to allow pairing between die 
two domains on the same chain. Accordingly, die Vh and V, domains of one fragment are fon«l to pair witf, 
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ihe complementaiy and V„ domains of another iiagment. thereby fonning two anUgen-binding sites. Another 
strategy for maldng bispecific antibody fragments by the use of single-chain Fv (sFv) dimen has also been 
repoited. See. Gniber a al. , 1. Immunol. 152:5368 (1994). 

Antibodies with more than two valencies are contemplated. For example, trispecific antibodies can be prepared. 
Tim « al. , J. Immunol. 147:60 (1991). 
5 Exemplaiy bispecific anubodies may bind to two different epitopes on a given PRO polypeptide herein. 

Alternatively, an anti-PRC polypeptide arm may be combined with an arm which binds to a triggering molecule 
on a leukocyte such as a T-cell receptor molecule (e.g. CD2, CDS, CD28. or B7). or Fc receptors for IgG 
(FcYR). such as FcyRI (CD64). FcyRII (CD32) and FcyRIII (CD16) so as to focus cellular defense mechanisms 
to the cell expressing the particular PRO polypeptide. Bispecific antibodies may also be used to localize 
10 cytotoxic agents to cells which express a particular PRO polypeptide. These antibodies possess a PRO-binding 
arm and an arm which binds a cytotoxic agem or a radionuclide chelator, such as EOTUBE. DFTA. DOTA. 
or TETA. Another bispecific antibody of mterest binds the PRO polypeptide and ftmher binds tissue factor 
(TF). 

5- HeteroconiiiMte Anrihndie» 
Heteroconjugate antibodies are also witiun the scope of tiie present invemion. Heteroconjugate 
antibodies are composed of two covalentiy joined antibodies. Such antibodies have, for example, been proposed 
to target immune system ceUs to unwarned cells [U.S. Patent No. 4.676.980]. and for treatment of HIV infection 
IWO 91/00360; WO 92AJ00373; EP 03089]. It is contemplated tiiat the antibodies may be prepared in viiro 
20 using known metiiods in syntiietic protein chemistry, mcludmg diose involving crosslinkiQg agents. For 
example, immunotoxins may be constructed using a disulfide exchange reaction or by forming a tiiioedier bond. 
Examples of suitable reagents for this purpose include iminotiiiolate and methyl-4-meicaptobutyrimidate and 
diose disclosed, for example, in U.S. Patem No. 4.676.980. 

^ 6. Effector Function Engineering 

It may be desirable to modify the amibody of the invemion with respect to effector fimcuon, so as to 

enhance, e.g.. die effectiveness of tiic antibody in treating cancer. For example, cysteine rcsidue(s) may be 

introduced into die Fc region, thereby allowing interchain disulfide bond formation in this region. The 

homodimeric amibody thus generated may have improved internalization capability and/or increased 

complement-mediated cell killing and antibody-dependent cellular cytotoxicity (ADCC). See Caron etal.,L 

lffii<ed..J76: 1191-1195 (I992)andShopes. Onsmnol.. 148: 2918-2922(1992). Homodimeric antibodies 

with enhanced anU-tumor activity may also be prepared using heterobifunctional cross-linkers as described in 

Wolffe/a/. Cancer Rejjeareh 52:2560-2565(1993). Alternatively, an amibody can be engmeered that has dual 

Fcr^gionsandmaytherebyhaveenhancedcomplememlysisandADCCcapabllities. SeeStevenson ^ra/., Aiui; 
35 Cancer Drug T^ijrni, 3: 219-230 (1989). 
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7. Immunoconju^ates 
The invention also pertains to immunoconjugates comprismg an antibody conjugated to a cytotoxic agent 
such as a chemotherapeutic agent, toxin (e.g. . an enzymatically active toxin of bacterial, fungal, plant, or animal 
origin, or fragments ihereoO, or a radioactive isotope (i.e.. a radioconjugate). 

Chemotherapeutic agents useful in the generation of such unmunoconjugates have been describedabove. 
5 Enzymatically active toxins and fragments thereof that can be used mclude diphtheria A chain, nonbmding active 
fragments of diphtheria toxin, exoioxm A chain (from Pseudomonas aeruginosa), ricin A chain, abrin A chain, 
modeccin A chain, alpha-sarcin. Aleuhies fordii proteins, dianihin proteins. Phytolaca americana proteins 
(PAPI. PAPII, and PAP-S), momordica charantia inhibitor, curcin, croiin, sapaonaria officinalis inhibitor, 
gelonin, mitogellin, resiriciocin, phenomycin, enomycin, and the tricoihecenes. A variety of radionuclides are 
10 available for the production of radioconjugated antibodies. Examples include '^^i, *^^In, and '"Re. 

Conjugates of the antibody and cytotoxic agent are made using a variety of bifimctional protein-coupling 
agents such as N-succinimidyI-3-(2-pyridyldithiol) propionate (SPDP), iminotiiiolane (IT), bifunctional 
derivatives of imidoesters (such as dimethyl adipimidate HCL), active esters (such as disuccinimidyl suberate). 
aldehydes (such as glutareldehyde). bis-azido compounds (such as bis (p-azidobenzoyi) hexanediamine). bis- 
15 diazonium derivatives (such as bis-(p^iazoniumbenzoyl>ethylenediaminc), diisocyanates (such as tolyene 2,6- 
diisocyanate), and bis-active fluorine compounds (such as 1 .5-dinuoro.2,4-dinitrobenzene). For example, a ricin 
immunotoxin can be prepared as described in Vitetta et al,. Science . 238: 1098 (1987). Carbon- 14-labeled 1- 
isothiocyanatobenzyI-3-methyldiethylene triaminepentaacetic acid (MX-DTPA) is an exemplary chelating agent 
for conjugation of radionucleoiide to the antibody. See W094/1 1026. 

In another embodiment, the antibody may be conjugated to a "receptor" (such streptavidin) for 
utilization in tumor pretargeting wherein the antibody-receptor conjugate is administered to the paticm, followed 
by removal of unbound conjugate from the circulation using a clearing agem and then admmistration of a 
"ligand" (e.g.. avidin) that is conjugated to a cytotoxic agent (e.g., a radionucleoiide). 

25 8. Immunoliposomes 

The antibodies disclosed herein may also be formulated as immunoliposomes. Liposomes containing 
the antibody are prepared by metiiods known in the an. such as described in Epstein et al., Proc. Natl. Acad 
S^' 36^^ (1985); Hwang et aL, Proc. Natl Acad. Sci. USA. 77: 4030 (1980); and U.S. Pat. Nos. 
4.485.045 and 4,544,545. Liposomes with enhanced circulation time are disclosed in U.S. Patent No. 
30 5,013,556. 

Particularly useful liposomes can be generated by the reverse-phase evaporation method with a lipid 
composition comprising phosphatidylchoiine.cholesierol.and PEG-derivatized phosphatidylethanolamine (PEG- 
PE). Liposomes are extruded through filters of defined pore size to yield liposomes with the desired diameter. 
Fab* fragments of the antibody of the present invention can be conjugated to the liposomes as described in Martin 
35 et al ■> J. Biol. Chem.. 257: 286-288 (1982) via a disulfide-interchange reaction. A chemotherapeutic agem 
(such as Doxorubicin) is optionally contained within the liposome. See Gabizon et at. , J. National Cancer Inst 
iL(19): 1484(1989). 
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^' Pharmaceuti cal Compositions of Antibodiejg 
Antibodies specifically binding a PRO polypeptide ideniifiedherein, as well as other molecules identified 
by the screening assays disclosed hereinbefore, can be administered for the treatment of various disorders in die 
form of phannaceutical con^oshions. 

If the PRO polypeptide is intracelhilar and whole antibodies aie used as inhibitois. intenializing 
5 antibodies are preferred. However, lipofections or liposomes can also be used to deliver the antibody, or an 
antibody fragment, into ceUs. Where antibody fragmems are used, the smaUest inhibitory fragment that 
specifically binds to the binding domain of the target protein is preferred. For example, based upon the variable- 
region sequences of an antibody, peptide molecules can be designed that retain the abUity to bind the target 
protein sequence. Such peptides can be synthesized chemically and/or produced by recombinant DNA 
10 technology. See. e.g.. Marasco et al.. Proc. Natl. Acarf <>ri vkx 90: 7889-7893 (1993). Ite finnuiaiion 
herein may also contain more than one active compound as necessary for the particular indication being treated, 
preferably those with complementary activities that do not adversely affect each other. Alternatively, or in 
addition, the composition may comprise an agem that enhances its function, such as. for example, a cytotoxic 
agent, cytokine, chemotiierapeutic agent, or growth-inhibitory agem. Such molecules are suitably presem in 
15 combination in amounts that art'effective for tiie purpose intended. 

The active ingredients may also be entrapped in microcapsules prepared, for example, by coacervation 
techniques or by imerfacial polymerization, for example, hydroxymeihylcellulose or gelaiin-miciocapsules and 
poly-(methylmeUiacylate) microcapsules, respectively, in colloidal drug delivery systems (for example, 
liposomes, albumin microspheres, microemulsions. nano-panides. and nanocapsules) or in macroemulsions. 
20 Such techniques are disclosed in Remington's Pharmaceutical ^x^r.^ supra. 

The formulations to be used for in Ww administration must be sterile. This is readily accomplished by 
filtration through sterile filtration membranes. 

Sustained-release preparations may be prepared. Suitable examples of sustained-release preparations 
include semipermeable matrices of solid hydrophobic polymers containing ti>e araibody. which matrices are in 
25 die form of shaped articles, e.g.. fihns. or microcapsules. Examples of sustained-release matrices include 
polyesters, hydrogels (for example. poly(2-hydroxyethyl-metiiacrylate). or poly(vinylalcohol)). polylactides 
(U.S. Pat. No. 3.773.919). copolymers of L-glutamic acid and y ethyl-L-glutamate. non-degradable ethylene- 
vinyl acetate, degradable lactic acid-glycolic acid copolymers such as die LUPRON DEPOT ™ (injectable 
microspheres composed of lactic acid-glycolic acid copolymer and leuprolide acetate), and poly-D-(-)-3- 
30 hydroxybutyric acid. While polymers such as ethylene-vinyl acetate and lactic acid-glycolic acid enable release 
of molecules for over 100 days, certain hydrogels release proteins for shorter time periods. When encapsulated 
antibodies remain in the body for a long time. Uiey may denanire or aggregate as a result of exposure to moisture 
at 37«C. resulting in a loss of biological activity and possible changes in immunogenicity. Rational strategies 
can be devised for stabilization depending on the mechanism im^olved. For example, if the aggreption 
inechanismisdiscovered to be imennolecularS-Sbondformation through UiioKlisulfideint^ 
may be achieved by modifying sulfhydiyl residues, lyophilizing from acidic solutions. comroUing moismre 
contem. using appropriate additives, and developing specific polymer matrix compositions. 
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Uses for ami- PRp Antihnriit.s 
The anU-PRO amibodies of the invention have various utilities. For example. anti-PRO antibodies may 
beuscdindiagnosticassaysforPRO. e.*.. detecting its expression in specific cells, tissues, or serum. Various 
diagnostic assay techniques known in the an may be used, such as competitive binding assays, direct or indirect 
sandwich assays and immunoprecipiution assays conducted in either heterogeneous or homogeneous phases 
^ M°"o«=to"al Anribodjp;: A M!.mial of T,Thnin„r. CRC Press. Inc. (1987)pp. 147-158]. TTie amibodies 

used in the diagnostic assays can be labeled with a detectable moiety, m dettctable moiety should be capable 
of producing, either direcUy or indirectiy. a detectable sigmti. For example, the deteaable moiety may be a 
radioisotope, such as ^H. "C. «P. «S. or -I. a fluorescem or chemiluminescent compound, such as fluorescein 
isothiocyanate. rhodamine. or luciferin. or an enzyme, such as alkaline phosphatase, beta-galactosidase or 
10 horseradish peroxidase. Any method known in the an for conjugating the antibody to the detectable moiety may 
be employed, including those methods described by Hunter et al.. Nature. J44:945 (1962); David ct al. 
Biochemisia, 11:1014 (1974); Painetal.. J. Immunol. Meth 40:219 (1981); and Nygren. J. Histochen. J 
Cvtochem.. 32:407 (1982). 

Ami-PRO antibodies also are usefiil for the affinity purification of PRO from recombinant cell culture ' 
15 or namral sources. In this process, the antibodies against PRO are immobilized on a suitable support, such a 
Sephadex resin or filter paper, using metiiods well known in the an. The immobilized antibody then is contacted 

widiasamplecomaining the PRO to be purified, and thereafter the suppon is washed withasuitable^^^^^^^ 
will remove substantially all the material in the sample except the PRO. which is bound to the immobilized 
amibody. Fmally. the suppon is washed with another suiuble solvem diat will release the PRO from the 
20 antibody. 

The following examples are offered for illustrative purposes only, and are not intended to limit the scope 
of the present invention in any way. 

All patemandliteramre referencescited inthe presem specificationarehereby incorporated by reference 
in their entirety. 
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EXAMPLES 

Commereially available reagents referred to in the examples were used according to manufacmrer's 
instnu:tions miless otherwise indicated. The source of those cells idemified in the following examples, and 
throughout the specification, by ATCC accession numbers is the American Type Culmre Collection. Manassas 
30 VA. 

g?^AMPlF. l fi^^trap^Hvlar nomain HomolnftV Screeninp tn in ^ ntitv Novel Pnlvpeptide. anri rONIA Fn^nHinf. 
Therefor 

TTie exiracelluter domain (BCD) sequences (including the secretion signal sequence, if any) from about 
35 950 known secreted proteins from the Swiss-Piot public database were used to seareh EST databases. The EST 
databases included public databases (e.g.. Dayhoff. GenBank). and proprietary databases (e.g. LIFESEQ™. 
Incyie Phannaceuticals. Palo Alto. CA). The seareh was performed using the computer program WU.BLAST.2 
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was sized to SOO-1000 bp, linkered with blunt to NoU adaptors, cleaved with SHI, and cloned into Sfil/NotI 
cleaved vecwr. pSST-ANfy .0 is a cloning vector that has a yeast alcohol dehydrogenase promoter preceding 
the cDNA cloning sites and the mouse amylase sequence (the manire sequence without the secretion signal) 
foHowed by the yeast alcohol dehydrogenase terminator, after the cloning sites. Thus. cDNAs cloned into this 
vector that aie fused in frame with amylase sequence will lead to die secieuon of amylase from appropriately 
transfected yeast colonies. 

3- Transfomation and Detection 

DNA from the libraiy described in paragraph 2 above was chilled on ice to which was added 
electrocompetcnt DHIOB bacteria (Life Technologies. 20 ml). The bacteria and vector mixnire was then 
electroporated as recommended by the manufacturer. Subsequently. SOC media (Ufe Technologies. 1 ml) was 
added and die mixture was incubated at 37«C for 30 minutes. The transformants were then plated onto 20 
standard 150 mm LB plates containing ampicillin and incubated for 16 hours (ZTQ. Positive colonies were 
scraped off the plates and the DNA was isolated from the bacterial pellet using standard protocols, e.g. CsCl- 
gradient. The (wrified DNA was then carried on to die yeast protocols below. 
15 The yeast mediods were divided- into diree categories: (1) Transformation of yeast with the 

plasmid/cDNA combined vector; (2) Detection and isolation of yeast clones secreting amylase; and (3) PCR 
amplification of die insert directly from die yeast colony and purification of the DNA for sequencing and ftrnher 
analysis. 

The yeast strain used was HD56.5A (ATCC-90785). This, strain has die following genotype: MAT 
20 alpha. ura3-52. leu2-3. Ieu2-112. his3-l 1. his3-15. MAL*. SUC*. GAL*. Preferably, yeast mutants can be 
employed Uiat have deficiem post-iranslational patiiways. Such mutants may have translocation deficient alleles 
in seel I, secll, sec62. widi truncated secll being most preferred. Alternatively, antagonists (including 
antisense nucleotides and/or ligands) which interfere widi die normal operation of diese genes, odier proteins 
implicated in diis post translation padiway (e.g., SEC61p. SE(r72p. SEC62p. SEC63p. TDJlp or SSAlp^p) 
25 or the complex formation of diese proteins may also be preferably employed in combination wiUi die amylase- 
expressing yeast. 

Transformation was performed based on die protocol outlined by Gietz ei al. , Nucl. Acid. Res 20: 1425 
(1992). Transformed cells were dien inoculated from agar into YEPD complex media brodi (100 ml) and grown 
overnight at 30»C. The YEPD brodi was prepared as described in Kaiser et al., Mediods in Yea« r.^n,..ir. 
Cold Spring Harbor Press. Cold Spring Harbor. NY. p. 207 (1994). The overnight culmre was dien diluted to 
about 2 X 10* ceUs/ml (approx. OD«o=0.1) into fresh YEPD broUi (500 ml) and regrown to 1 x 10 ' cells/ml 
(approx. OD4oo=0.4^).5). 

The cells were dien harvested and prepared for transfoniiation by transfer into GS3 rotor botUes m a 
Sorval GS3 rotor at 5.000 rpm for 5 minutes, die supernatant discarded, and dien resuspended imo sterile water. 
35 and cerarifijged again in 50 ml frJcon tubes at 3.500 rpm in a Beckman GS-6KR cemriftige. The supernatant 
was discarded and die cells were subsequently washed widi LiAcTTE (10 ml. 10 mM Tris-HO. I mM EDTA 
pH 7.5. 100 mM LijCXJCCH,). and resuspended imo LiAc/TE (2.5 ml). 
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The underlined regions of the oligonucleotides annealed to the ADH promoter region and the amylase 
region, respectively, and amplified a 307 bp region from vector pSST-AMY.O when no insert was present. 
15 Typically, the first 18 nucleotides of the 5' end of these oligonucleotides contained am.ealing sites for the 
sequencing primers. Thus, the total product of the PCR reaction from an empty vector was 343 bp. However, 
signal sequence-fused cDNA resulted in considerably longer nucleotide sequences. 

FoUowing the PCR. an aliquot of die reaction (5 ^1) was examined by agarose gel electrophoresis in 
a 1 % agarose gel using a Tris-Borate-EDTA (TBE) buffering system as described by Sambrook et al.. suEia, 
Clones resulting in a single strong PCR product larger than 400 bp were ftmher analyzed by DNA sequencing 
after purification with a 96 Qiaquick PCR clean-up column (Qiagen Inc.. Chatsworth. CA). 
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EXAMPLE 3 : Isolation of cDNA ri»n e s Using 5Sit>nal Algorithm Analy^j ^ 

Various polypeptide^ncoding nucleic acid sequences were identified by applying a pioprietaiy signal 
sequence finding algorithm developed by Genentech, Inc. (South San Francisco. CA) upon ESTs as weU as 
dusteied and assembled EST fragments from public (e.g.. GenBank) and/or private (UFESEQ*. Incyte 
Pharmaceuticals. Inc.. Palo Alto. CA) daubases. The signal sequence algorithm computes a secretion signal 
score based on the character of the DNA nucleotides surrounding the first and optionally the second methionine 
codon(s) (ATG) at the 5'-end of the sequence or sequence fragmem under consideration. The nucleotides 
30 following the first ATG must code for at least 35 unambiguous amino acids without any stop codons. If the first 
ATG has the required amino acids, the second is not examined. If neidier meets the requirement, the candidate 
sequence is not scored. In order to determine whedier the EST sequence contains an authemic signal sequence, 
the DNA and corresponding amino acid sequences surrounding the ATG codon are scored using a set of seven 
sensors (evaluation parameters) known to be associated with secreuon signals. Use of this algorithm resulted 
35 in the identification of numerous polypeptide-encoding nucleic acid sequences. 

EXAMPLE 4: Isolation of c DNA clones Encoding Human PR0281 

In order to obtain a cDNA clone encoding PR028I. methods described m Klein et al.. Proc. Natl. 
Acad. Sci. USA 93:7108-71 13 (1996) were employed widi the following modifications. Yeast transformation 
was performed with limiting amounts of transforming DNA in order to reduce the number of multiple 
transformed yeast cells. Instead of plasmid isolation from the yeast followed by transformation of E. coU as 
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described in Klein et al. . suEra. PCR analysis was performed on single yeast colonies. PCR primers employed 
were bipartite in order to amplify the insert and a small portion of the invenase gene (allowing to determine that 
the insen was in frame with invenase) and to add on universal sequencing primer sites. 

An invertase library was transformed into yeast and posiUves were selected on sucrose plates. Positive 
clones were re-tested and PCR products were sequenced. The sequence of one clone, PR0281 . was determined 
to comain a signal peptide coding sequence. Oligonucleotide primers and probes were designed using the 
m«:leotide sequence of PR0281. A fuU length plasmid library of cDNAs from human umbilical vein 
endothelium tissue was titeied and approximately 100.000 cfti were plated in 192 pools of 500 cfu/pool into 96- 
well round bottom pUtes. The plates were sealed and pools were grown overnight at 37^ with shaking 
(200rpm). PCR was performed on the individual cultures using primers. Agarose gel electrophoresis was 
performed and positive wells were identified by visualization of a band of the expected size. Individual positive 
clones were obtained by colony lift foUowed by hybridization with «P-labeled oligomicleotide. These clones 
were characterized by PCR. restriction digest, and southern blot analyses. 

A fiiU length clone was idemified that contained a single open reading frame with an apparent 
translational initiation site at nucleotide positions 80-82. and a stop signal at nucleotide positions 1115-1117 
15 (Figure 1. SEQ ID NO:l). The predicted, polypeptide precursor is 345 amino acids long, has a calculated 
molecular weight of approximately 37.205 daltons and an estimated pi of approximately 10. 15. Analysis of die 
full-length PR0281 sequence shown in Figure 2 (SEQ ID N0:2) evidences die presence of the following: a 
signal peptide from about amino acid 1 to about amino acid 14. multiple transmembrane domains from about 
amino acid position 83 to about amino acid position 105. from about amino acid position 126 to about amino acid 
20 position 146. from about amino acid position 158 to about amino acid position 177. from about amino acid 
position 197 to about amino acid position 216. from about amino acid position 218 to about amino acid position 
238. from about amino acid position 245 to about amino acid position 265. and from about amino acid position 
271 to about amino acid position 290 and an amino acid sequence block having homology to G-proiein coupled 
receptor proteins from about amino acid 1 15 to about amino acid 155. Clone UNQ244 (DNA16422-1209) has 
25 been deposited widi ATCC on June 2, 1998 and is assigned ATCC deposit no. 209929. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35). using a WU-BLAST-2 sequence 
aligmnem analysis of die full-length sequence shown in Figure 2 (SEQ ID N0:2), evidenced significant 
homology between die PR0281 amino acid sequence and die following Dayhoff sequences: H64634. 
AF033095_1. B64815. YBHL.ECOU. EMEQUTRJ. AF064763 3. S53708. A69253. AF035413 12 and 
30 S63281. 

EXAMPLE 5: Isolation of c DNA clones Fncodine Human PR0276 

In order to obtain a cDNA clone encoding PR0276. mediods described m Klein etal.. PNAS. 21:7108- 
71 13 (1996) were employed witi, die following modifications. Yeast transformation was performed witfi limiting 
35 amounts of transforming DNA in order to reduce die number of multiple transformed yeast cells. Instead of 
plasmid isolation from die yeast followed by transformation of E. coU as described in Klein et al.. sum. PCR 
analysis was performed on single yeast colonies. PCR primers employed were bipartite in order to amplify die 
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insenaada small portion of the invertasc gene (allowing lo determine that the insen was in frame with invertase) 
and to add on universal sequencing primer sites. 

An mvenase library was transformed imo yeast and positives were selected on sucrose plates. Positive 
clones were re-tested and PCR products were sequenced. The sequence of one clone. PR0276. was determined 
to contain a signal peptide coding sequence. Oligonucleotide primers and probes were designed using the 
5 nucleotide sequence of PR0276. A full length plasmid library of cDN As from human fetal liver cells was titered 
and approximately 100.000 cfu were plated in 192 pools ofSOOcfu/pool into 9^well round bottom plates. The 
plates were sealed and pools were grown overnight at 37 C with shaking (200rpm). PCR was performed on the 
individual culmres using primers . Agarose gel electrophoresis was performed and positive wells were identified 
by visualization of a band of the expected size. Individual positive clones were obtained by colony lift followed 
10 by hybridization with «P-labeled oligom-cleotide. These clones were characterized by PCR. restriction digest, 
and southern biot analyses. 

A full length clone was identified diat contained a single open reading frame with an apparent 
translational initiation site at nucleotide positions 180-182 and a stop signal at nucleotide positions 933-935 
(Figure 3: SEQ ID N0:5). The predicted polypeptide precursor is 251 amino acids long has a calculated 
15 molecular weight of approximately 28.801 daltons and an estimated pi of approximately 9.58. TT,e 
transmembrane domains are approximately at amino acids 98^1 16 and 152-172 of the sequence shown in Figure 
4 (SEQ ID N0:6). Clone DNA16435.1208 (UNQ243) has been deposited with the ATCC and is assigned 
ATCC deposit no. 209930 . 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST-2 sequence 
alignmem analysis of the frill-length sequence shown in Figure 4 (SEQ ID N0:6). revealed some sequence 
identity between the PR0276 amino acid sequence and the following Dayhoff sequences: CEG25D7 2. 
ATT805_2. S69696. GRHR.RAT. NPCBAABCD_3. AB013149_1. P_R85942 and AP000006_5. 

EXAMPLE 6 : Isolation of m N A cinms Fnrodine Human PPniBo 

A clone designated herein as DNA14187 was isolated as described in Example 2 above from a human 
retina tissue library. The DNA14 187 sequence is shown in Figure 7 (SEQ ID N0:9). Based on the DNA 14 187 
sequence shown in Figure 7 (SEQ ID N0:9). oligomurleotides were synthesized: 1) to identify by PCR a cDNA 
library that comained the sequence of interest, and 2) for use as probes to isolateacloneofthe 
sequence for PR0189. Forward and reverse PCR primers generally range from 20 to 30 nucleotides and are 
30 oftendesignedtogiveaPCRpioductofaboutlOO-lOOObpinlength. The probe sequences are typically 40-55 
bp in length. In order to screen several libraries for a full-length clone. DNA from die libraries was screened 
by PCR amplification, as per Ausubel et al.. Current Protocol, i.. Molecular Rinioov with the PCR primer pair. 
A positive library was tiien used to isolate clones encoding the gene of interest using tiie probe oligonucleotide 
and one of the primer pairs. 
35 A pair of PCR primers (forward and reverse) were synthesized: 

fpTwardPCRprimrr 5'-TTGACCTATACAGAGATTCATC-3- (SEQ ID NO: 10); and 
reverse PCR primer 5--CTAAGAACTTCCCTCAGGAnTT-3' (SEQ ID NO: 11). 
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Addidonally, a synthetic oligonucleotide hybridization probe was constructed from the DNA14187 sequence 
which had die following nucleotide sequence: 
hybridization probe 

5'-ATGAAGATCAATTTCAAGAAGCATGCACTTCTCCTCriTGC.3' (SEQ ID NO: 12). 

In order lo screen several libraries for a source of a full-length clone, DNA from the libraries was 
5 screened by PGR amplification with the PGR primer pair identified above. A positive library was then used to 
isolate clones encoding the PRO 189 gene using the probe oligonucleotide and one of the PGR primers. 

RN A for construction of the cDN A libraries was isolated from human retina tissue (LIB94). The cDNA 
libraries used to isolate the cDNA clones were constructed by standard methods using commercially available 
reagents such as those from Invitrogen, San Diego, CA. The cDNA was primed with oligo dT containing a NotI 
10 site, linked with blunt to Sail hemikinased adaptors, cleaved with NotI, sized appropriately by gel 
electrophoresis, and cloned in a defined orientauon into a suitable cloning vector (such as pRKB or pRKD; 
pRK5B is a precursor of pRKSD that does not contain the Sfil site; sec. Holmes et al., Science , 253: 1278-1280 
(1991)) in the unique Xhol and NotI sites. 

DNA sequencing of the clones isolated as described above gave die fiill-lengtfi DNA sequence for 
15 PR0189 and die derived protein sequence for PROI89. 

The entire nucleotide sequence of DNA21624-1391 \s shown in Figure 5 (SEQ ID NO:7). Clone 
DNA2 1624- 1391 contains a single open reading frame with an apparent translaiional initiation site at nucleotide 
positions 2(X)-202 and ending at the stop codon at nucleotide positions 1301-1303 (Figure 5). The predicted 
polypeptide precursor is 367 amino acids long (Figure 6). The full-length PRO 189 protein shown in Figure 6 
20 has an estimated molecular weight of about 41,871 daltons and a pi of about 5.06. Clone DNA21624-1391 has 
been deposited with the ATCC. Regarding the sequence, it is understood that the deposited clone contains die 
correct sequence, and die sequences provided herein are based on known sequencing techniques. 

Analyzing die amino acid sequence of SEQ ID N0:8, die puutive N-glycosylation sites are at about 
amino acids 224-227, 246-249 and 285-288. A domain for cytosolic fatty-acid binding proteins is at amino acids 
25 78-107 of SEQ ID NO:8. The corresponding nucleotides can be routinely determined given die sequences 
provided herein. 

Some sequence identity was found to W01A6. 1 and F35D1 1 . 1 1 , C. Elegans proteins, designated in a 
Dayhoff database as CEWOI A6_10 and CELF35D1 1,1 1 , respectively. Some sequence identity was also found 
to an antigen to malaria and to restin, designated in a Dayhoff database as P_R05766 and AF014012_1, 
30 respectively. Some sequence identity was also found to a micrombule binding protein and to myosin, designated 
in a Dayhoff database as AF041382_1 and S07537. respectively. There is also some sequence identity widi 1- 
phosphatidylinositol-4, 5-bispho5phate, designated as PIPl^RAT. 

EXAMPLE 7 : Isolation of cDNA clones Encoding Human PRO190 
35 A clone designated herein as DNA 14232 was isolated as described in Example 2 above from a human 

fetal retina tissue library. The DNAI4232 sequence is shown in Figure 10 (SEQ ID NO: 15). Based on die 
DNA14232 sequence, oligonucleotides were symhesized: 1) to identify by PCR a cDNA library dial contained 
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the sequence of interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for 
PRO 190. Fonvard and reverse PGR primers generally range from 20 to 30 nucleotides and are often designed 
to give a PGR product of about 100-1000 bp in length. The probe sequences are typically 40-55 bp in length. 
In order to screen several libraries for a fiill-length clone, DNA from the libraries was screened by PGR 
amplification, as per Ausubel et al.. Current Protocols in Molecular Biology, with the PGR primer pair. A 
5 positive library was then used to isolate clones encoding the gene of interest using the probe oligonucleotide and 
one of the primer pairs. 

A pair of PGR primers (forward and reverse) were synthesized: 
forward PGR primer 5'-GTATAGGTAGTGTAGGTTGT-3* (SEQ ID NO: 16); and 
reverse PGR primer 5'-TGAGAGAATTGGTTGGAGGA-3' (SEQ ID NO: 17). 
10 Additionally, a synthetic oligonucleotide hybridization probe was constructed from the DNA14232 sequence 
which had the following nucleotide sequence: 
hybridization probe 

5 -ACAGTGGTGTAGTGATGGTGTAATATGGTGGTTGTGAAGA-3' (SEQ ID NO: 18). 

In order to screen several libraries for a source of a full-lengih clone, DNA from the libraries was 
15 screened by PGR amplification^with the PGR primer pair identified above. A positive library was then used to 
isolate clones encoding the PRO190 gene using the probe oligonucleotide and one of the PGR primers. 

RN A for construction of the cDNA libraries was isolated from human retina tissue (LIB94). The cDN A 
libraries used to isolate the cDNA clones were constructed by standard methods using commercially available 
reagents such as those from Invitrogen, San Diego, GA, The cDN A was primed with oligo dT containing a Noil 
20 site, linked with blunt to Sail hemikinased adaptors, cleaved with Notl. sized appropriately by gel 
electrophoresis, and cloned in a defined orientation into a suitable cloning vector (such as pRKB or pRKD; 
pRK5B is a precursor of pRK5D that does not contain the Sfil site; see. Holmes et al. , Science . 253: 1278-1280 
(1991)) m the unique Xhol and Notl sites. 

DNA sequencing of the clones isolated as described above gave sequences which include the fiill -length 
25 DNA sequence for PRO190 [herein designated as DNA23334- 13921 (SEQ ID NO: 13) and the derived protein 
sequence for PRO190. 

The entire nucleotide sequence of DNA23334-1392 is shown in Figure 8 (SEQ ID NO: 1 3). Glone 
DNA23334-1392 contains a single open reading frame with an apparent translational initiation site at nucleotide 
positions 193-195 and which ends at the stop codon at nucleotide positions 1465-1467 (Figure 8). The predicted 

30 polypeptide precursor is 424 amino acids long (Figure 9). The fiill-length PRO190 protein shown in Figure 9 
has an estimated molecular weight of about 48,500 daltons and a pl of about 8.65. Glone DNA23334-1392 has 
been deposited with the ATGG. Regarding the sequence, it is understood that the deposited clone contains the 
correct sequence, and the sequences provided herein are based on known sequencing techniques. 

Analyzing the amino acid sequence of SEQ ID NO: 14, the putative transmembrane domains are at about 

35 ammo acids 16-36, 50-74, 147-168, 229-250, 271-293, 298-318 and 328-368 of SEQ ID NO: 14. N- 
glycosylaiion sites are at about amino acids 128-131, 204-207, 218-221 and 274-377 of SEQ ID NO:14. The 
corresponding nucleotides can be routinely determined given the sequences provided herein. 
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PRO190 has sequence identity with at least the following Dayhoff sequences designated as: 
CEZK896^2, JC5023, GMSl JCHPO and S44668. 

EXAMPLE 8 : Isolation of cDNA clones Encoding Human PR0341 

A clone designated herein as DNA 12920 was isolated as described in Example 2 above from a human 
5 placenta tissue library. The DNA12920 sequence is shown in Figure 13 (SEQ ID N0:2I). The DNA12920 
sequence was then compared to various EST databases including public EST databases (e.g., GenBank), and a 
proprietary EST database {LIFESEQ®» Incyte Pharmaceuticals, Palo Alto, CA) to identify homologous EST 
sequences. The comparison was performed using the computer program BLAST or BLAST2 (Altschul et al.. 
Methods in Enzvmology. 266:460-480 (1996)]. Those comparisons resulting in a BLAST score of 70 (or in 

10 some cases, 90) or greater that did not encode known proteins were clustered and assembled into a consensus 
DNA sequence with the program "phr^" (Phil Green, University of Washington, Seattle, Washington). This 
consensus sequence is herein designated DNA25314. Oligonucleotide primers based upon the DNA253I4 
sequence were then synthesized and employed to screen a human placenta cDNA library which resulted in the 
identification of the DNA26288-1239 clone shown in Figure 1 1 . The cloning vector was pRK5B (pRK5B is a 

15 precursor of pRK5D that does not contain the Sfil site; see, Holmes et al.. Science . 252*. 1278-1280 (1991)), and 
the cDNA size cut was less than 2800 bp. 

A full length clone was identified that contained a single open reading frame with an apparent 
iranslational initiation site ai nucleotide positions 380-382, and a stop signal at nucleotide positions 1754-1756 
(Figure II, SEQ ID NO: 19). The predicted polypeptide precursor is 458 amino acids long, has a calculated 

20 molecular weight of approximately 50,264 daltons and an estimated pi of approximately 8. 17, Analysis of the 
fiill-lengdi PR0341 sequence shown in Figure 12 (SEQ ID NO:20) evidences the presence of the following: a 
signal peptide from about amino acid 1 to about amino acid 17, transmembrane domains from about amino acid 
171 to about amino acid 190, from about amino acid 220 to about amino acid 239, from about amino acid 259 
to about amino acid 275, from about amino acid 286 to about amino acid 305, from about amino acid 316 to 

25 about amino acid 335, from about amino acid 353 to about amino acid 378 and from about amino acid 396 to 
about amino acid 417 and potemial N-giycosylation sites from about amino acid 145 to about amino acid 147 
and from about amim) acid 155 to about amino acid 158. Clone DNA26288-1239 has been deposited with 
ATCC on April 21, 1998 and is assigned ATCC deposit no. 209792. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST-2 sequence 

30 alignment analysis of the full-length sequence shown in Figure 12 (SEQ ID NO:20), evidenced homology 
between the PR0341 amino acid sequence and the following Dayhoff sequences: S75696, H69788, D69852, 
A69888, B64918, F64752, LPU89276_1, G64962, S52977 and S44253, 

EXAMPLE 9: Isolation pf cDNA clones Encoding Huynan P|^O?80 
35 A clone designated herein as DNA12922 was isolated as described in Example 2 alK)ve from a human 

placenta tissue library. The DNA12922 sequence is shown in Figure 16 (SEQ ID NO:24). The DNA12922 
sequence was then con^ared to various EST databases including public EST databases (e.g., GenBank), and a 
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proprietaiy EST database (LIFESEQ®, Incyie Pharmaceuticals, Palo Alto, CA) lo ideiuify homologous EST 
sequences. The comparison was performed usmg the computer program BLAST or BLAST2 (Alischul ct ai.. 
Methods in Enzvmologv, 266:460-480 (1996)]. Those comparisons resulting in a BLAST score of 70 (or in 
some cases, 90) or greater thai did not encode known proteins were clustered and assembled into a consensus 
DNA sequence with the program "phrap" (Kiil Green. University of Washington, Seattle, Washington). 
5 An oligonucleotide probe was formed based upon the consensus sequence obtained above. This probe 

had the following sequence. 

5*-ACCTGTTAGAAATGTGGTGGTTTCAGCAAGGCCTCAGTTT (SEQ ID NO:25). 
This probe was used to screen a human placenta library prepared as described in paragraph 1 of Example 2 
above. The cloning vector was pRK5B (pRK5B is a precursor of pRK5D that does not contain the Sfil site; see, 
10 Holmes ei al.. Science . 253:1278-1280 (1991)); and the cDNA size cut was less than 2800 bp. A clone 
designated herein as DNA26843-1389 was obtained. 

The entire nucleotide sequence of DNA26843-1389 is shown in Figure 14 (SEQ ID NO:22). Clone 
DNA26843-1389 comains a single open reading frame with an apparent translational initiation site at nucleotide 
positions 121-123 and ending at the stop codon at nucleotide positions 919-921 (Figure 14). The predicted 
15 polypeptide precursor is 266 amino acids long (Figure 15). The full-length PRO180 protein shown in Figure 
15 has an estimated molecular weight of about 29,766 daltons and a pi of about 8.39. Clone DNA26843-1389 
has been deposited wiUi the ATCC. Regarding the sequence, it is understood that the deposited clone comains 
the correct sequence, and the sequences provided herein are based on known sequencing techniques. 

StUl analyzing the amino acid sequence of SEQ ID NO:23. the transmembrane domains are at about 
20 amino acids 13-33 (type II). 54-73, 94-1 13, 160-180 and 122-141 of SEQ ID NO:23. N-myristoylaiion sites 
are at abom amino acids 57-62, 95-100. 99-104. 124-129 and 183-188 of SEQ ID NO:23. The corresponding 
nucleotides can be routinely determined given the sequences provided herein. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35). using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 15 (SEQ ID NO:23). evidenced some sequence 
25 identity between the PRO180 amino acid sequence and the following Dayhoff sequences: CEC33All_2. 
CEG11E6 5, CELW03A5_1 AND PEU83861_2 (NADH dehydrogenase subunit 4L. mitochondrion). 

EXAMPLE 10: Isolation of cDNA clones Encoding Hu man PRO 194 

A consensus DNA sequence was assembled relative to other EST sequences using phrap as described 

30 in Example I above. This consensus sequence is herein DNA19464. Based on die DNA19464 consensus 
sequence, oligonucleotides were synthesized: 1) to identify by PCR a cDNA library that contained the sequence 
of iniercsi, and 2) for use as probes to isolate a clone of the full-length coding sequence for PR0194. PCR 
primers (forward and reverse) were synthesized based upon the DNA19464 sequence. Additionally, a synthetic 
oligonucleotide hybridization probe was constructed from the consensus DNA19464 sequence. 

35 In order to screen several libraries for a source of a fiilWengdi clone, DNA from the libraries was 

screeited by PCR amplification with the PCR primer pair identified above. A positive library was then used to 
isolate clones encoding the PR0194 gene using die probe oligonucleotide and one of the PCR primers. RNA 
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for constntction of the cDNA libraries was isolated from human fetal lung tissue (LIB25). 

DNA sequencing of the clones isolated as described above gave the fiilMength DNA sequence for 
PR0194 [herein designated as DNA26844-1394] (SEQ ID NO:27) and the derived protein sequence for 
PR0194. 

The entire nucleotide sequence of DNA26844-1394 is shown in Figure 17 (SEQ ID NO:27). Clone 
5 DNA26844-1394 contains a single open reading frame with an apparent translational initiation site at nucleotide 
positions 81-83 and ending at the stop codon at nucleotide positions 873-875 (Figure 17). The predicted 
polypeptide precursor is 264 amino acids long (Figure 18). The full-length PR0194 protein shown in Figure 
18 has an estimated molecular weight of about 29,665 daltons and a pi of about 9.34. Analysis of the full-length 
PR0194 sequence shown in Figure 18 (SEQ ID NO:28) evidences the presence of various important 
10 polypeptides domains as shown in Figure 18. Clone DNA26844-1394 has been deposited with ATCC on June 
2, 1998 and is assigned ATCC deposit no. 209926. 

Analysis of the amino acid sequence of the full-length PRO 194 polypeptide suggests that it does not 
exhibit significant sequence similarity to any known human protein. However, an analysis of the Dayhoff 
database (version 35.45 SwissProt 35) evidenced some homology between the PR0194 amino acid sequence and 
15 the following Dayhoff sequences. HUMORFT^l. CET07F10J, ATFCA9,I2, F64934, YDJX^ECOLl, 
ATAF00065719F29G20.19, H70002. S76980. H64934 and S76385. 

EXAMPLE 11: Isolation of cDNA clones Encoding Human PRO203 

A clone designated herein as DNA 156 18 was isolated as described in Example 2 above from a human 

20 fetal lung tissue library. The DNA15618 sequence is shown in Figure 21 (SEQ ID N0:31). Oligonucleotide 
probes were generated from the sequence of the DNA 1561 8 molecule and were used to screen a human fetal lung 
library (LIB26) prepared as described in paragraph 1 of Example 2 above. The cloning vector was pRK5B 
(pRK5B is a precursor of pRK5D that does not contain the Sfil site: see. Holmes etal.. Science. 253:1278-1280 
(1991)), and the cDNA size cut was less than 2800 bp. 

25 A fiill length clone was identified that contained a single open reading frame with an apparent 

translational initiation site at nucleotide positions 159-161 and ending at the stop codon found at nucleotide 
positions 12(X)-1202 (Figure 19; SEQ ID NO:29). The predicted polypeptide precursor is 347 amino acids long, 
has a calculated molecular weight of approximately 39,870 daltons and an esiunated pi of approximately 6.76. 
Analysis of the full-length PRO203 sequence shown in Figure 20 (SEQ ID NO:30) evidences the presence of 

30 the following: a type II transmembrane domain at about amino acid 64 to about amino acid 87; possible N- 
glycosylaiion sites at about amino acid 147 to about amino acid 150, about amino acid 155 to about amino acid 
158, and about amino acid 237 to about amino acid 240; sequence identity with heavy-metal-associaied domain 
proteins at about amino acid 23 to about amino acid 45, and sequence identity with D-isomer specific 2- 
hydroxyacid dehydrogenase at about amino acid 24 to about amino acid 34. Clone DNA30862-1396 was 

35 deposited with the ATCC on June 2, 1998, and is assigned ATCC deposit no. 209920. 

Analysis of the amino acid sequence of the full-length PRO203 polypeptide suggests that it possesses 
sequence similarity to GST ATPase, thereby indicating that PRO203 may be a novel GST ATPase. More 
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specifically, an anal3fsis of the DayhofT database (version 35.45 SwissProt 35) evidenced homology between the 
PRO203 amino acid sequence and the following Dayhoff sequences, AF008I24_1. CFRCDIGEN_1. and 
P_R82566. 

EXAMPLE 12: Isolation of cDNA clones Encoding Human PRO290 
5 An expressed sequence tag (EST) DNA database (LIFESEQ®, Incyte Pharmaceuticals. Palo Alio. CA) 

was searched and an EST was identified that had homology to beige and FAN. An oligonucleotide probe based 
upon the identified EST sequence was then synthesized and used to screen human fetal kidney cDNA libraries 
in an attempt to identify a full-length cDNA clone. The oligonucleotide probe had the following sequence: 
5* TGACTGCACTACCCCGTGGCAAGCTGTTGAGCCAGCrCAGCrrG 3* (SEQ ID NO:34). 

10 RNA for construction of cDNA libraries was isolated from human fetal kidney tissue. The cDNA 

libraries used to isolate the cDN A clones encoding himian PRO290 were constructed by standard methods using 
commercially available reagents such as those from Invitrogen. San Diego. CA. The cDNA was primed with 
oligo dT containing a NotI site, linked with blunt to Sail hemikinased adaptors, cleaved widi Not!, sized 
appropriately by gel electrophoresis, and cloned in a defmed orientation into a suitable cloning vector (such as 

IS pRKB or pRKD; pRK5B is a precursor of pRK5D that does not contain the Sfil site; see. Holmes et al.. Science 
253:1278-1280 (1991)) in the unique Xhol and Nod. 

A cDNA clone was identified and sequenced in entirely. The entire nucleotide sequence of DN A35680- 
1212 is shown in Figure 22 (SEQ ID NO:32). Clone DN A35680- 1212 contains a single open reading frame with 
an apparent translational initiation site at nucleotide positions 293-295. and a stop codon at nucleotide positions 

20 3302-3304 (Figure 22; SEQ ID NO:32). The predicted polypcpiide precursor is 1003 amino acids long. 

It is currently believed that the PRO290 polypeptide is related to FAN and/or beige. Clone DNA35680- 
1212 has been deposited with ATCC and is assigned ATCC deposit no. 209790. It is understood that the 
deposited clone has the actual correct sequence rather than the representations provided herein. The lull-length 
PRO290 protein shown in Figure 23 has an estimated molecular weight of about 1 12,013 daltons and a pi of 

25 about 6.4. 

EXAMPLE 13: Isolation of cDNA Clones Encoding Human PR0874 

A consensus DNA sequence designated herein as DNA36459 was identified using phrap as described 

in Example 1 above. Based on the DNA36459 consensus sequence, oligonucleotides were synthesized: 1) to 
30 identify by PCR a cDNA library that contained the sequence of interest, and 2) for use as probes to isolate a 

clone of the coding sequence for PR0874. 

PCR primers (forward and reverse) were synthesized: 

forward PCR primer 5'-TCGTGCCCAGGG(}CTGATGTGC-3' (SEQ ID NO:37); and 

reverse PCR primer 5*-GTCnTTACCCAGCCCCGGGATGCG-3' (SEQ ID NO:38). 
35 Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus DNA36459 

sequence which had the following nucleotide sequence: 
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hybridization probe 

S ^jGCCTAATCCAACGTTCTGTCTTCAATCTGCAAATCTATGGGGTCCT (SEQ id N0:39). 

In order lo screen several libraries for a source of a clone, DNA from ihe libraries was screened by 
PGR amplification with the PGR primer pair identified above. A positive library was then used to isolate clones 
encoding the PR0874 gene using the probe oligonucleotide and one of the PGR primers. RNA for construction 
S of the cDNA libraries was isolated from human fetal lung tissue (LIB25). 

DNA sequencing of the clones isolated as described above gave the DNA sequence for PR0874 [herein 
designated as DNA40621-1440] (SEQ ID NO:35) and the derived protein sequence for PR0874. 

The entire nucleotide sequence of DNA40621-1440 is shown in Figure 24 (SEQ ID NO:35). Clone 
DNA4062 1-1440 contains a single open reading frame ending at the stop codon at nucleotide positions 964-966 
10 (Figure 24). The predicted polypeptide encoded by DNA4062 M440 is 321 amino acids long (Figure 25). The 
PR0874 protein shown in Figure 25 has an estimated molecular weight of about 36, 194 daltons and a pi of about 
9.85. Analysis of the PR0874 sequence shown in Figure 25 (SEQ ID NO:36) evidenced the presence of the 
following: a type II transmembrane domain at about amino acids 57-80; additional transmembrane domains at 
about amino acids 1 10-126. 215-231, and 254-274; potential N-g]ycosylation sites at about amino acids 16-19, 
15 27-30, and 289-292; sequence identity with hypothetical YBR002c family proteins at about amino acids 276-287; 
and sequence identity with ammonium transponer proteins ai about amino acids 204-230. Clone DNA40621- 
1440 was deposited with the ATCC on June 2, 1998, and is assigned ATCC deposit no. 209922. 

Analysis of the amino acid sequence of the PR0874 polypeptide suggests that it is a novel multi-span 
transmembrane protein. However, an analysis of the Dayhoff database (version 35.45 SwissProt 35) evidenced 
20 sequence identity between the PR0874 amino acid sequence and the following Dayhoff sequences: S67049, 
AF054839^1. S73437, S52460, and HIVU80570^1. 

EXAMPLE 14: Isolation of cDNA Clones Encoding Human PRO710 

A yeast screening assay was employed to identify cDN A clones that encoded potential secreted proteins . 
25 Use of this yeast screening assay allowed identification of a single cDNA clone whose sequence (herein 

designated as DNA38190) is shown in Figure 28 (SEQ ID NO:42). Based on the DNA38190 sequence shown 

in Figure 28, oligonucleotides were synthesized: 1) to identify by PCR a cDNA library that contained the 

sequence of interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for PRO710. 

In order to screen several libraries for a full-length clone, DNA from the libraries was screened by PCR 
30 amplification, as per Ausubel et al.. Current Protocols in Molecular Biology, with the PCR primer pair. A 

positive library was then used to isolate clones encoding the gene of interest using the probe oligonucleotide and 

one of the primer pairs. 

PCR primers (forward and reverse) were synthesized: 

forward PCR primer 5*-TTCC(K:AAAGAGTTCTACGAGGTC}G-3' (SEQ ID NO:43) 
35 reverse PCR primer 5 -ATTGACAACATTGACTGGCCTATGGG-3' (SEQ ID NO:44) 

Additionally, a synthetic oligonucleotide hybridization probe was constructed from the DNA38190 sequence 

which had the following nucleotide sequence 
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hybridization probe 

5'-GTGGATGCTCTGTGTGCGTGCAAGATCCrTCAGGCCTTGTTCCAGTGTGA-3' (SEQ ID NO:45) 

In order to screen several libraries for a source of a fiill-length clone. DNA from the libraries was 
screened by PGR amplification with the PGR primer pair identified above. A positive library was then used to 
isolate clones encoding the PR0710 gene using the probe oligonucleotide and one of the PGR primers. 
5 RNA for construction of the cDN A libraries was isolated from human fetal kidney tissue (L1B227). 

The cDNA libraries used to isolate the cDNA clones were constructed by standard methods using commercially 
available reagents such as those from Inviirogen, San Diego, CA. The cDNA was primed with oligo dT 
containing a Notl site, linked with blimt to Sail hemikinased adaptors, cleaved with NotI, sized appropriately 
by gel electrophoresis, and cloned in a defmed orientation into a suitable cloning vector (such as pRKB or 

10 pRKD: pRK5B is a precursor of pRK5D that does not contain the Sfil site; see. Holmes et al.. Science . 
253:1278-1280 (1991)) in the unique Xhol and Notl sites. 

A fiill length clone was identified that contained a single open reading frame with an apparent 
translational initiation site at nucleotide positions 67-69 and ending at the stop codon found at nucleotide positions 
1765-1767 (Figure 26, SEQ ID NO:40). The predicted polypeptide precursor is 566 amino acids long, has a 

IS calculated molecular weight of approximately 65,555 daltons and an estimated pi of approximately 5.44. 
Analysis of the full-length PRO710 sequence shown in Figure 27 (SEQ ID N0:4l) evidences die presence of 
the following: a signal peptide from about amino acid 1 to about amino acid 32, a transmembrane domain from 
about amino acid 454 to about amino acid 476, an aminoacyl-transfer RNA synthetase class-II signature sequence 
from about amino acid 6 to about amino acid 26 and potential N-glycosylation sites from about amino acid 1 1 1 

20 to about amino acid 1 14, from about amino acid 146 to about amino acid 149 and from about amino acid 292 
to about amino acid 295. Clone DNA44161-1434 has been deposited with ATCC on May 27, 1998 and is 
assigned ATCC deposit no. 209907. 

Analysis of the amino acid sequence of the full-length PRO710 polypeptide suggests that it possesses 
significant sequence similarity to the CDC45 protein, thereby indicating that PRO710 may be a novel CDC45 

25 homolog. More specifically, an analysis of the Dayhoff database (version 35.45 SwissProt 35) evidenced 
significant homology between the PRO710 amino acid sequence and the following Dayhoff sequences, 
HSAJ3728^1, CEF34D10^1, S64939, UMU50276_1. TRHYJHEEP, CELT14E8_l. RNA1_YEAST. 
LVU89340,1, HSU80736_1 and CEZK337J. 

30 EXAMPLE 15: Isolation of cDNA clones Encoding Human PR01151 

A consensus DNA sequence was assembled relative to odier EST sequences using phrap as described 
in Example I above. This consensus sequence is herein designated DNA40665. Based on the DNA40665 
consensus sequence, oligonucleotides were synthesized: 1) to identify by PGR a cDNA library that contained 
the sequence of interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for 

35 PR01151. 
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PCR primers (forward and reverse) were synthesized: 
forward PCR primer 5'-CCAGACGCTGCTCTTCGAAAGGGTC-3' (SEQ ID NO:48) 
reverse KTR primer 5 -GGTCCCCGTAGGCCAGGTCCAGC-3* (SEQ ID NO:49) 
Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus DNA40665 
sequence which had the following nucleotide sequence 
5 hybridization probe 

5 -CTAc i ivri CAGCCTCAATGTGCACAGCTGGAATTACAAGGAGACGTA^ (SEQ ID NO:50) 
In order to screen several libraries for a source of a full-lengdi clone. DNA from the libraries was 

screened by PCR amplification with the PCR primer pair identified above. A positive library was then used to 

isolate clones encoding the PRO! 151 gene using the probe oligonucleotide and one of the PCR primers. RNA 
10 for construction of the cDNA libraries was isolated from human fetal kidney tissue. 

DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for 

PROl 151 (designated herein as DNA44694.1500 [Figure 29, SEQ ID NO:461; and the derived protein sequence 

forPR01151. 

The entire nucleotide sequence of DNA44694-15(X) is shown in Figure 29 (SEQ ID NO:46). Clone 
15 DNA44694-1500 contains a single open reading frame with an apparent translational initiation site at nucleotide 
positions 272-274 and ending at the stop codon at nucleotide positions 1049-1051 (Figure 29). The predicted 
polypeptide precursor is 259 amino acids long (Figure 30). The full-length PROl 151 protein shown in Figure 
30 has an estimated molecular weight of about 28,770 daltons and a pi of about 6. 12. Analysis of the full-length 
PROl 151 sequence shown in Figure 30 (SEQ ID NO:47) evidences the presence of the following: a signal 
20 peptide from about amino acid 1 to about amino acid 20, a potential N-glycosyiation site from about amino acid 
72 to about amino acid 75 and amino acid sequence blocks having homology to Clq domain-containing proteins 
from about amino acid 144 to about amino acid 178, from about amino acid 78 to about amino acid 1 1 1 and from 
about amnx) acid 84 to about amino acid 117. Clone UNQ581 (DNA44694-1500) has been deposited with 
ATCC on August 11, 1998 and is assigned ATCC deposit no. 203114. 
25 An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST-2 sequence 

alignment analysis of the fiill-length sequence shown in Figure 30 (SEQ ID NO:47), evidenced significant 
homology between the PROl 151 amino acid sequence and the following Dayhoff sequences: ACR3 HUMAN, 
HP25^TAMAS, HUMC1QB2_1, P^R99306, CAIF^HUMAN. JX0369, CA24^HUMAN. S32436, P^R28916 
and CA54_HUMAN. 

30 

EXAMPLE 16: Isolation of cDNA clones Encoding Human PRO 1282 

A consensus DNA sequence was assembled relative to other EST sequences using phrap as described 
in Example 1 above. This consensus sequence is designated herein as DNA33778. Based on iheDNA33778 
consensus sequence, oligonucleotides were synthesized: 1) to identify by PCR a cDNA library that contained 
35 the sequence of imerest, and 2) for use as probes to isolate a clone of the full-length coding sequence for 
PR01282. 
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PCR primers (forward and reverse) were synthesized: 
forward PGR primer yrCTTCAnrrnrTrnrGCAArrrr'i* (SEQ ID NO:53): and 
reverse PCR primer 5 TTGCTCAr ATrrAnrrrrrr.r Ar^.^' (SEQ id N0:54). 

Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus 
DNA33778 sequence which had the following nucleotide sequence: 
5 hybridization probe 

5TGGATGTTGTCCAGACAACCAGCTGGAGCrrGTATCCGAGGC3' (SEQ ID NO:55). 

In order to screen several libraries for a source of a full-length clone, DNA from the libraries was 
screened by PCR amplification with the PCR primer pair identified above, A positive library was then used to 
isolate clones encoding the PRO 1282 gene using the probe oligonucleotide and one of the PCR primers. RNA 
10 for construction of the cDNA libraries was isolated from human fetal liver. 

DNA sequencing of the clones isolated as described above gave the frill-lengdi DNA sequence for 
PR01282 (designated herein as DNA45495-1550 (Figure 3 1 . SEQ ID N0:5 1 1; and the derived protein sequence 
for PROI282. 

The entire coding sequence of PR01282 is shown in Figure 3 1 (SEQ ID NO:51). Clone DNA45495- 
15 1550 contains a single open reading frame with an apparent translational initiation site at nucleotide positions 
120-122, and an apparent stop codon at nucleotide positions 2139-2141 (SEQ ID N0:5I). The predicted 
polypeptide precursor is 673 amino acids long. The signal peptide is at about amino acids 1-23; the 
transmembrane domain is at about amino acids 579-599; an EGF-like domain cysteine panem signature starts 
at about amino acid 430; and leucine zipper patterns start ai about amino acids 197 and 269 of SEQ ID NO:52, 
20 see Figure 32. Clone DNA45495-1550 has been deposited with the ATCC and is assigned ATCC deposit no. 
203156. The fiill-lengih PRO 1282 protein shown in Figure 32 has an estimated molecular weight of about 
71,655 daltons and a pi of about 7.8. 

An analysis of the Dayhoff database (version 35.45 SwissProi 35), using a WU-BLAST-2 sequence 
alignment analysis of the full-length sequence shown in Figure 32 (SEQ ID NO:52), revealed sequence identity 
25 between the PR01282 amino acid sequence and the following Dayhoff sequences (data from database 
incorporated by reference): AB007876_1, RNPLGPV^I, MUSLRRP^l, ALS^PAPPA, AC004i42.1, 
ALS^HUMAN, AB014462_1, DMTARTAN^l, HSCHON03_1 and S46224. 

EXAMPLE 17: Isolation of cDNA clon es Encoding Human PRQ358 
30 Using the method described in Example 1 above, a single EST sequence was identified in the Incyte 

database, designated herein as INC31 15949. Based on the INC31 15949 EST sequence, oligonucleotides were 

synthesized to identify by PCR a cDNA library that conuined the sequence of interest and for use as probes to 

isolate a clone of the full-length coding sequence for PR0358. 

A pair of PCR primers (forward and reverse) were synthesized: 
35 forward PCR primer 5'-TrrrArrAnnTATrATA a ArrnA a.i- (SEQ ID NO:58) 

reverse PCR primer S ^TTATAOAr A ATrTf^TTrrrATrAr.AnA,^' (SEQ ID NO:59) 
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A probe was also synthesized: 

5'-AAAAAGCATACTTGGAATGGCCCAAGGATAGGTGTAAATG-3* (SEQ ID NO:60) 

In order to screen several libraries for a source of a fulMengih clone. DNA from the libraries was 
screened by PGR amplification widi die PGR primer pair idenUfied above. A positive library was Uxen used to 
isolate clones encoding die PR0358 gene using die probe oligonucleotide and one of die PGR primers. RNA 
for construction of die cDNA libraries was isolated from human bone marrow (LIB256). The cDNA libraries 
used to isolated die cDN A clones were constructed by standard mediods using commercially available reagents 
such as diose from Invitrogen, San Diego. CA. The cDNA was primed widi oligo dT containing a NoU site, 
linked widi blunt to Sail hemikinased adaptors, cleaved widi Noil, sized appropriately by gel electrophoresis, 
and cloned in a defined orientation into a suitable cloning vector (such as pRKB or pRKD; pRKSB is a precursor 
of pRK5D diat does not contain die Sfil site; see. Holmes et al. . Science . 253 : 1278-1280 (1991)) in die unique 
Xhol and NotI sites. 

DNA sequencing of die clones isolated as described above gave the fiill-lengdi DNA sequence for 
PR0358 (Figure 33, SEQ ID NO:56) and die derived protein sequence for PR0358 (Figures 34, SEQ ID 
NO:57). 

15 The entire nucleotide sequence of die clone identified (DN A4736 M 1 54) is shown in Figure 33 (SEQ 

ID NO:56). Clone DNA47361.1154 contains a single open reading frame with an apparent iranslational 
initiation site (ATG stan signal) at nucleotide positions underlined in Figure 33. The predicted polypeptide 
precursor is 81 1 amino acids long, including a putative signal sequence (amino acids 1 to 19). an extracellular 
domain (amino acids 20 to 575. including leucine rich repeats in the region from position 55 to position 575), 

20 a putative transmembrane domain (amino acids 576 to 595). Clone DNA47361-I249 has been deposited widi 
ATCC and is assigned ATCC deposit no. 209431. 

EXAMPLE 18: Isolation of cDNA clones Encoding Human PROnm 

A consensus DNA sequence was assembled relative to odier EST sequences using phrap as described 
25 in Example 1 above. This consensus sequence is designated herein as DNA37164. Based on die DNA37164 
consensus sequence, oligonucleoUdes were syndiesized: 1) to identify by PGR a cDNA library diat contained 
die sequence of interest, and 2) for use as probes to isolate a clone of die full-lengdi coding sequence for 
PRO1310. 

PGR primers (forward and reverse) were synthesized: 
30 forward PGR primer: 5'GTTGTCAATGAGCTAGGGGTGGGG3' (SEQ ID NO:63) and 
reverse PGR nrimpr'S*rr.rr,ATr,TAnTnnA A^njnn^r^^Q^^j^y ^j^q jj^ imo:64). 

Additionally, a syndieiic oligonucleotide hybridization probe was constructed fi-om die consensus 
DNA47394 sequence which had die following nucleotide sequence: 
hybridization probe- 

35 5 ATCGGGATAAACGGTGAGTCCTGGTTTGATAATGGGAGCATGTGCATGAG3* (SEQ ID NO:65). 

In order to screen several libraries for a source of a fulMengdi clone. DNA from die libraries was 
screened by PGR amplification widi die PGR primer pair identified above. A positive library was dien used to 
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isolate clones encoding the PRO1310 gene using the probe oligonucleotide and one of die PGR primers. RNA 
for construction of the cDNA libraries was isolated from human fetal liver tissue. 

DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for 
PRO1310 and the derived protein sequence for PR01310. 

The entire coding sequence of PRO1310 is shown in Figures 35A-B (SEQ ID N0:61). Clone 
5 DNA47394-1572 contains a single open reading frame with an apparent translaiional initiation site at nucleotide 
positions 326-328, and an qjpareni stop codon at nucleotide positions 2594-2596 (SEQ ID N0:61). The 
predicted polypeptide precursor is 765 amino acids long. The signal peptide is at about amino acids 1-25 of SEQ 
ID NO:62. Clone DNA47394-1572 has been deposited with ATCC and is assigned ATCC deposit no. 203109. 
The full-length PRO1310 protein shown in Figure 36 has an estimated molecular weight of about 85,898 daltons 
10 and a pi of about 6.87. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST-2 sequence 
alignment analysis of the full-length sequence shown in Figure 36 (SEQ ID NO:62), revealed sequence identity 
between the PRO1310 amino acid sequence and the following Dayhoff sequences: AF017639_1, P_W36817, 
JC5256, CBPH_HUMAN. MMU23184_1, CBPN^HUMAN. HSU834I1^I. CEF01D4_7, RNU62897^1 and 
15 P_^W11851. 

EXAMPLE 19: Isolation of cDNA Clones Encoding Human PR0698 

A yeast screening assay was employed to identify cDNA clones that encoded potential secreted proteins. 

Use of this yeast screening assay allowed identification of a single cDNA clone whose sequence (herein 
20 designated as DNA39906) is shown in Figure 39 (SEQ ID NO:68). Based on the DNA39906 sequence shown 

in Figure 39, oligonucleotides were synthesized: 1) to identify by PGR a cDNA library that contained the 

sequence of interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for PR0698. 

In order to screen several libraries for a full-length clone, DNA from the libraries was screened by PGR 

amplification, as per Ausubel et al.. Current Protocols in Molecular Biology, with the PCR primer pair. A 
25 positive library was then used to isolate clones encoding the gene of interest using the probe oligonucleotide and 

one of the primer pairs. 

PCR primers (forward and reverse) were synthesized: 

forward PCR primer 5'-AGCTGTGGTCATGGTGGTGTGGTG-3* (SEQ ID NO:69) 

reverse PCR primer 5'-CTACCTTGGCCATAGGTGATCCGC-3* (SEQ ID NO:70) 
30 Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus DNA39906 

sequence which had the following nucleotide sequence 

hybridization probe 

5'.CATCAGCAAACCGTCTGTGGTTCAGCTCAACTGGAGAGCKiTT-3' (SEQ ID N0:71) 

In order to screen several libraries for a source of a ftill-length clone, DNA from the libraries was 
35 screened by PCR amplification with the PCR primer pair identified above. A positive library was dien used to 
isolate clones encoding the PR0698 gene using the probe oligonucleotide and one of the PCR primers. RNA 
for construction of die cDNA libraries was isolated from human bone marrow tissue (LIB255). The cDNA 
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libraries used ro isolate the cDNA clones were constructed by standard methods using commercially avaUable 
reagents such as those from Invitrogen, San Diego. CA. The cDNA was primed with oligo dT containing a Noil 
site, linked with blunt to Sail hemikinased adaptors, cleaved with NotI, sized appropriately by gel 
electrophoresis, and cloned in a defined oricnution imo a suitable cloning vector (such as pRKB or pRKD; 
pRKSB is a precursor of pRKSD that does not contain the Sfil site; see. Holmes et al. . Science . 253: 1278-1280 
5 (1991)) in the unique Xhol and NotI sites. 

A fiill length clone was identified that contained a single open reading frame with an apparent 
translational initiation site at nucleotide positions 14-16 and ending at the stop codon found at nucleotide positions 
1544-1546 (Figure 37, SEQ ID NO:66). The predicted polypeptide precursor is 510 amino acids long, has a 
calculated molecular weight of approximately 57,280 daltons and an estimated pi of approximately 5.61. 
10 Analysis of the full-length PR0698 sequence shown in Figure 38 (SEQ ID NO:67) evidences the presence of 
the following: a signal peptide from about amino acid 1 to about amino acid 20, potential N-glycosylarion sites 
from about amino acid 72 to about amino acid 75, from about amino acid 136 to about amino acid 139, from 
about amino acid 193 to about amino acid 196, from about amino acid 253 to about amino acid 256. from about 
amino acid 352 to about amino acid 355 and from about amino acid 41 1 to about amino acid 414 an amino acid 
15 block having homology to legume lectin beta-chain proteins from about amino acid 20 to about amino acid 39 
and an amino acid block having homology to the HBGF/FGF family of proteins from about amino acid 338 to 
about amino acid 365. Clone DNA48320-1433 has been deposited with ATCC on May 27, 1998 and is assigned 
ATCC deposit no. 209904. 

Analysis of the amino acid sequence of the full-length PR0698 polypeptide suggests that it possesses 
20 significant sequence similarity to the olfactomedin protein, thereby indicating that PR0698 may be a novel 
olfactomedin homolog. More specifically, an analysis of the Dayhoff database (version 35.45 SwissProt 35) 
evidenced significant homology between the PR0698 amino acid sequence and the following Dayhoff sequences. 
OLFM^RANCA, 173637, AB006686S3^1 . RNU78105_1, RNU72487_1, P_R98225, CELC48E7^4. 
CEF11C3_3, XLU85970_1 and S42257. 

25 

EXAMPLE 20: Isolation of c DNA Clones Encoding Human PR073:> 

A yeast screening assay was employed to idemify cDN A clones that encoded potential secreted proteins. 
Use of this yeast screening assay allowed identification of a single cDNA clone whose sequence (herein 
designated as DNA42580) is shown in Figure 45 (SEQ ID NO:77). The DNA42580 sequence was then 

30 compared to a variety of known EST sequences to identity homologies. The EST databases employed included 
public EST databases (e.g.. GenBank) and a proprietary . EST DNA database (UFESEQ^. tacyie 
Pharmaceuticals, Palo Alto, CA). The search was performed using the computer program BLAST or BLAST2 
(Altshul ei al.. Methods in ETmmioiny>y -yfj^df^n^sm (1995)) as a comparison to a 6 frame translation of the EST 
sequence. Those comparisons resulting m a BLAST score of 70 (or in some cases 90) or greater that did not 

35 encode known proteins were clustered and assembled into consensus DNA sequences widi the program "phrap" 
(Phil Green, University of Washington, Seattle, Washington). 
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Using the above analysis, a consensus DNA sequence was assembled relative to other EST sequences 
using phn^. This consensus sequence is herein designated consenOl . Proprietary Genemech EST sequences 
were enqiloyed in the consensus assembly and they are herein designated DNA20239 (Figure 42: SEQ ID 
NO:74), DNA38050 (Figure 43; SEQ ID NO:75) and DNA40683 (Figure 44; SEQ ID NO:76). 

Based on the consenOl sequence, oligonucleotides were synthesized: 1) to identify by PGR a cDNA 
5 library that contained the sequence of interest, and 2) for use as probes to isolate a clone of the full-length coding 
sequence for PR0732. Forward and reverse PGR primers generally range from 20 to 30 nucleotides and are 
often designed to give a PGR product of about 100- 1(XX) bp in length. The probe sequences are typically 40-55 
bp in length. In some cases, additional oligonucleotides are synthesized when the consensus sequence is greater 
than about l-l.5kbp. In order to screen several libraries for a full-length clone, DNA from the libraries was 

10 screened by PGR amplification, as per Ausubel et ai.. Current Protocols in Molecular Biology, with the PGR 
primer pair. A positive library was then used to isolate clones encoding the gene of interest using the probe 
oligonucleotide and one of the primer pairs. 

PGR primers (forward and reverse) were synthesized: 
forward PGR orimer 5 -ATGTTTGTGTGGAAGTGGCCGG-3' (SEQ ID NO:78) 

15 forward PGR primer 5'-GTCAACATGCTCCTCTGC-3' (SEQ ID NO:79) 

reverse PGR primer 5'>AATCCATTGTGCAGTGCAGCTCTAGG-3' (SEQ ID NO: 80) 

reverse PGR primer 5^<SAGCATGCCAGCACTGGAGTGAC■3' (SEQ ID N0:81) 

Additionally, a synthetic oligonucleotide hybridization probe was consirucied from the consensus DNA44143 

sequence which had the following nucleotide sequence 

20 hybridization probe 

5*-GCGGATGGTGTCGTAGTGGAAAGAACrrGGA(rrGTAAGTAGATTGATCTATGGAC-3' (SEQ ID 
NO:82) 

In order to screen several libraries for a source of a full-length clone, DNA from the libraries was 
screened by PGR amplification with the PGR primer pairs identified above. A positive library was then used 

25 to isolate clones encoding the PR0732 gene using the probe oligonucleotide and one of the PGR primers. 

RNA for construction of the cDNA libraries was isolated from human fetal lung tissue (LIB26). The 
cDNA libraries used to isolate the cDNA clones were constructed by standard methods using commercially 
available reagents such as those from Invitrogen. San Diego, GA. The cDNA was primed whh oligo dT 
containing a NotI site, linked with blunt to Sail hemikinased adaptors, cleaved with NotI, sized appropriately 

30 by gel electrophoresis, and cloned in a defined orientation into a suitable cloning vector (such as pRKB or 
pRKD; pRKSB is a precursor of pRKSD that does not contain the Sfil site; see. Holmes et al.. Science . 
251:1278-1280 (1991)) in the unique Xhol and Nod sites. 

A fiill length clone was identiHed that contained a single open reading frame with an apparent 
translational initiation site at nucleotide positions 88-90 and ending at the stop codon found at nucleotide positions 

35 1447-1449 (Figure 40. SEQ ID NO;72). The predicted polypeptide precursor is 453 amino acids long, has a 
calculated molecular weight of approximately 50,419 daltons and an estimated pi of approximately 5.78. 
Analysis of the fuiMength PR0732 sequence shown in Figure 41 (SEQ ID NO:73) evidences the presence of 
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the following: a signal peptide from about amino acid 1 lo about amino acid 28, transmembrane domains from 
about amino acid 37 to about amino acid 57, from about amino acid 93 to about amino acid 109, from about 
amino acid 126 to about amino acid 148. from about amino acid 151 to about amino acid 172, from about amino 
acid 197 to about amino acid 215, from about amino acid 231 to about amino acid 245. from about amino acid 
260 10 about amino acid 279, from about amino acid 315 lo about amino acid 333. from about amino acid 384 
5 to about amino acid 403 and from about amino acid 422 to about amino acid 447, potemial N-glycosylation sites 
from about amino acid 33 to about amino acid 36. from about amino acid 34 to about amino acid 37. from about 
amino acid 179 to about amino acid 183, from about amino acid 298 to about amino acid 301, from about amino 
acid 337 to about amino acid 340 and from about amino acid 406 to about amino acid 409, an amino acid block 
having homology to the MIP family of proteins from about amino acid 1 19 to about amino acid 149 and an 

10 amino acid block having homology to DNAJRNA non-specific endonuclease proteins from about ammo acid 279 
to about amino acid 286. Clone DNA48334-1435 has been deposited with ATCC on June 2. 1998 and is 
assigned ATCC deposit no. 209924. 

Analysis of the amino acid sequence of the full-length PR0732 polypeptide suggests that it possesses 
significant sequence similarity to the Diff33 protein, thereby indicating that PR0732 may be a novel DifG3 

15 homolog. More specificaUy. an analysis of the Dayhoff daubase (version 35.45 SwissProt 35) evidenced 
significant homology between the PR0732 amino acid sequence and the following Dayhoff sequences. 
HS179M20,2. MUSTETU^l. CER11H6_2. RATDRP_1, S51256. E69226. AE000869_1, JC4120. 
CYB^PARTE and P_R50619. 

20 EXAMPLE 21: Isolation of cDNA clones Encoding Human PRO! 120 

A consensus DNA sequence was assembled relative to other EST sequences using phrap as described 

in Example 1 above. This consensus sequence is designated herein consen0352. The consen0352 sequence was 

then extended using repeated cycles of BLAST and phrap to extend the consensus sequence as far as possible 

using the sources of EST sequences discussed above. The extended consensus sequence is designated herein as 
25 DNA34365. Based on the DNA34365 consensus sequence, oligonucleotides were synthesized: 1) to identify 

by PCR a cDNA library that contained the sequence of interest, and 2) for use as probes to isolate a clone of the 

full-lengdi coding sequence for PROl 120. 

PCR primers (forward and reverse) were synthesized: 

forward PCR primeffi! 5*-GAAGCCGGCTGTCTGAATC-3' (SEQ ID NO:85), 
30 5 .GGCCAGCTATCTCCGCAG-3' (SEQIDNO:86).5*-AAGGGCCTGCAAGAGAAG-3* (SEQ1DN0:87), 

5'-CACTGGGACAACTGTGGG-3* (SEQ ID NO:88), 

5*-CAGAGGCAACGTGGAGAG-3* (SEQ ID NO:89). and 

5*-AAGTATTGTCATACAGTGTTC-3' (SEQ ID NO:90); 

reverse PCR primers: 5'-TAGTACTTGGGCACGAGGTTGGAG-3* (SEQ ID N0:91). and 5'- 
35 TCATACCAACTGCrrGGTCATTGGC-3' (SEQ ID NO:92). 

Additionally, a synthetic oligonucleotide hybridization probe was constructed from the DNA34365 
consensus sequence which had the following nucleotide sequence: 
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hybridization probe: 

5--CTCAAGCTGCTGGACACGGAGCGGCCC}GTGAATCGGTTTCACTTGO^ (SEQ ID NO:93). 

In order to screen several libraries for a source of a full-length clone. DNA from die libraries was 
screened by PGR amplificauon witii die PGR primer pairs identified above. A positive library was then used 
to isolate clones encoding die PROl 120 gene using die probe oligonucleotide and one of die PGR primers. RNA 
5 for construction of die cDNA libraries was isolated from human fetal kidney tissue. 

DNA sequencing of die clones isolated as described above gave die full-lengdi DNA sequence for 
PROl 120 (designated herein as DNA48606-1479 [Figures 46A-B, SEQ ID NO:83]; and die derived protein 
sequence for PROl 120. 

The entire coding sequence of PROl 120 is shown in Figures 46A-B (SEQ ID NO:83). Clone 
10 DNA48606-1479 contains a single open reading frame widi an apparent translational initiation site at nucleotide 
positions 608-610 and an apparent stop codon at nucleotide positions 3209-321 1 . The predicted polypeptide 
precursor is 867 amino acids long. The fuU-lengdi PROl 120 protein shown in Figure 47 has an estimated 
molecular weight of about 100,156 Dalions and a pi of about 9.44. Additional feamres of die PROl 120 
polypeptide include a signal peptide at about amino acids 1-17; a sulfatase signature at about amino acids 86-98; 
15 regions of homology to sulfatases at about amino acids 87-106, 133-146. 216-229. 291-320, and 365-375; and 
potential N-glycosylation sites at about amino acids 65-68, 112-115, 132-135, 149-152. 171-174. 198-201,241- 
245. 561-564. 608-611. 717-720, 754-757. and 764-767. 

An analysis of die Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST-2 sequence 
alignment analysis of die fiUl-lengdi sequence shown in Figure 47 (SEQ ID NO:84), revealed significant 
20 homology between die PROl 120 amino acid sequence and die following Dayhoff sequences: CELK09C4J, 
GL6S^HUMAN,G65169.NCU89492_l,BCU44852^1,E64903,P_R51355,STS_HUMAN,GA6S^HUMAN, 

and IDS_MOUSE. Clone DNA48606-1479 was deposited with die ATCC on July 1, 1998. and is assigned 
ATCC deposit no. 203040. 

25 EXAMPLE 22: Isolation of cDNA clones Encoding Hu man PR0537 

Use of die signal sequence algoridim described in Example 3 above allowed identification of an EST 
cluster sequence from die Incyie database, designated as Incyie EST cluster no. 29605. This EST cluster 
sequence was dicn compared to a variety of expressed sequence tag (EST) daubases which included public EST 
databases(e.g., GenBank)andaproprietaiy EST DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, 

30 CA) to identify existing homologies. The homology search was performed usmg die computer program BLAST 
or BLAST2 (Altshul et al.. Methods in Enzvmoloev 266:460-480 (1996)). Those comparisons resulting in a 
BLAST score of 70 (or in some cases 90) or greater diat did not encode known proteins were clustered and 
assembled into a consensus DNA sequence widi die program •^phrap" (Phil Green. University of Washington. 
Seattle. Washington). The consensus sequence obtained dierefrom is herein designated DNA48350, 

35 In light of an observed sequence homology between die DNA48350 consensus sequence and an EST 

sequence encompassed widiin die Merck EST clone no. R63443. die Merck EST clone R63443 was purchased 
and die cDNA insert was obtained and sequenced. It was found diat diis insert encoded a full-lengdi protein. 
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The sequence of ihis cDNA imen is shown in Figure 48 and is herein designated as DNA4914I-1431. 

Clone DNA4914 1-1431 contains a single open reading frame with an apparent translational initiation 
site at nucleotide positions 97-99 and ending at the stop codon at nucleotide positions 442-444 (Figure 48). The 
predicted polypeptide precursor is 1 15 amino acids long (Figure 49). The full-length PR0537 protein shown 
in Figure 49 has an estimated molecular weight of about 13, 183 daltons and a pi of about 12. 13. Analysis of 
5 the full-length PR0537 sequence shown in Figure 49 (SEQ ID NO:95) evidences the presence of the following: 
a signal peptide from about amino acid 1 to about amino acid 31, a potential N-glycosylation site from about 
amino acid 44 to about amino acid 47» potential N-myristolation sites from about amino acid 3 to about amino 
acid 8 and from about amino acid 16 to about amino acid 21 and an amino acid block having homology to 
multicopper oxidase proteins from about amino acid 97 to about amino acid IC^S. Clone DNA49141-1431 has 

10 been deposited with ATCC on June 23, 1998 and is assigned ATCC deposit no. 203003. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST-2 sequence 
alignment analysis of the full-length sequence shown in Figure 49 (SEQ ID NO:95), evidenced homology 
between the PR0537 amino acid sequence and the following Dayhoff sequences: A54523, CELF22H10_2, 
FKH4_MOUSE,OTXl_HUMAN,lJRBI_USTMA.KNOB_PIJVFN,A32895_l.AF036332,l,HRG_HUM 

15 and HRP3_PLAFS. 

EXAMPLE 23: Isolation of cDNA clones Encoding Human PRQ536 

Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST 
cluster sequence from the Incyte database, designated herein as ss.clu2437.inii. This EST cluster sequence was 

20 then compared to a variety of expressed sequence tag (EST) databases which included public EST databases 
(e.g., GenBank) and a proprietary EST DNA database (LIFESEQ®, Incyte Pharmaceuticals. Palo Alto. CA) to 
identify existing homologies. The homology search was performed using die computer program BLAST or 
BLAST2 (Altshul ci al.. Methods in Enzvmologv 266:460^80 (19%)). Those comparisons resulting in a 
BLAST score of 70 (or in some cases 90) or greater that did not encode known proteins were clustered and 

25 assembled into a consensus DNA sequence with the program "phrap'* (Phil Green, University of Washington, 
Seatde, Washington). consensus sequence obtained therefrom is herein designated DNA48351. 

In light of an observed sequence homology between the DNA48351 consensus sequence and an EST 
sequence encompassed within the Merck EST clone no. H 1 1 129. the Merck EST clone HI 1 129 was purchased 
and the cDNA insen was obtained and sequenced. It was found that diis insen encoded a full-length protein. 

30 The sequence of rhis cDNA insen is shown in Figure 50 and is herein designated as DNA49142-1430. 

Clone DNA49I42-1430 contains a single open reading frame widi an apparent translational initiation 
site at nucleotide positions 48-50 and ending at the stop codon at nucleotide positions 987-989 (Figure 50). The 
predicted polypeptide precursor is 313 amino acids long (Figure 51). The full-length PR0536 protein shown 
in Figure 51 has an estimated molecular weight of about 34. 189 daltons and a pi of about 4.8. Analysis of the 

35 fiill-lengih PR0536 sequence shown in Figure 51 (SEQ ID NO:97) evidences the presence of the following: a 
signal peptide from about amino acid 1 to about amino acid 25, a potential N-glycosylation site from about amino 
acid 45 to about amino acid 48 and an amino acid sequence block having homology to sulfatase proteins from 
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about amino acid 16 to about amino acid 26. Clone DNA49142-1430 has been deposited with ATCC on June 
23» 1998 and is assigned ATCC deposit no. 203002. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST-2 sequence 
alignment analysis of the full-length sequence shown in Figure 51 (SEQ ID NO:97), evidenced homology 
between the PR0536 amino acid sequence and the following Dayhoff sequences: APU46857 1, PK2 DICDI, 
5 H64743. F5I14_18, CEAM^ECOLI, GEN14267, H64965, TCU39815^1. PSBJ^ODOSI and P R06980. 

EXAMPLE 24 : Isolation of cDNA clones Encoding Human PR0535 

Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST 
cluster sequence from the Incyte database, designated herein as ss.clu 12694.init. This EST cluster sequence was 

10 then compared to a variety of expressed sequence tag (EST) databases which included public EST databases 
(e.g., GenBank) and a proprietary EST DNA database (UFESEQ* Incyte Pharmaceuticals, Palo Alto. CA) to 
identify existing homologies. The homology search was performed using the computer program BLAST or 
BLAST2 (Altshul et al.. Methods in Enzvmologv 266:460-480 (1996)). Those comparisons resultmg in a 
BLAST score of 70 (or in some cases 90) or greater that did not encode Icnown proteins were clustered and 

15 assembled into a consensus DNA sequence with die program "phrap" (Phil Green, University of Washington, 
Seattle, Washington). TTie consensus, sequence obtained therefrom is herein designated DNA48352. Two 
propietary Genentech EST sequences were employed in die assembly arc are herein shown in Figures 54 and 
55, 

In light of an observed sequence homology between the DNA48352 consensus sequence and an EST 
20 sequence encompassed within the Merck EST clone no. H86994, the Merck EST clone H86994 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 52 and is herein designated as DNA49143-i429. 

Clone DNA49 143- 1429 contains a single open reading frame with an apparent translational initiation 
site at nucleotide positions 78-80 and ending at die stop codon at nucleotide positions 681-683 (Figure 52). The 
25 predicted polypeptide precursor is 201 amino acids long (Figure 53). The full-length PR0535 protein shown 
in Figure 53 has an estimated molecular weight of about 22, 180 daltons and a pl of about 9.68. Analysis of the 
full-lengdi PR0535 sequence shown in Figure 53 (SEQ ID NO:99) evidences the presence of the following: a 
signal peptide from about amino acid 1 to about amino acid 25, a transmembrane domain from about amino acid 
155 to about amino acid 174, a potential N-glycosylation site from about amino acid 196 to about amino acid 
30 199 and FKBP-typc peptidyl-prolyl cis-irans isomer signanire sequences from about amino acid 62 to about 
amino acid 77, from about amino acid 87 to about amino acid 123 and from about amino acid 128 to about amino 
acid 141. Clone DNA49143-1429 has been deposited with ATCC on June 23, 1998 and is assigned ATCC 
deposit no. 203013. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST- sequence 
35 alignment analysis of the full-length sequence shown in Figure 53 (SEQ ID NO:99), evidenced homology 
between the PR0535 amino acid sequence and the following Dayhoff sequences: S71237, P_R9355I, P R28980. 
S71238, FKB2^HUMAN, CELC05C8^1. S55383. S72485, CELC50F2_6 and S75144. 
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EXAMPLE 25: Isolation of cDNA clones Encoding Human PR0718 

A cDNA sequence isolated in the amylase screen described in Example 2 (human fetal lung library) 
above is herein designated DNA435I2 (see Figure 62; SEQ ID NO: 108). The DNA43512 sequence was then 
compared to a variety of expressed sequence tag (EST) databases which included public EST databases (e.g., 
GcnBank) and a proprietary EST DNA database (LIFESEQ^, Incyie Pharmaceuticals. Palo Alto. CA) to 
identify existing homologies. The homology search was performed using the computer program BLAST or 
BLAST2 (Alishul et al.. Methods in Enzynioio^ 266:460^80 (1996)). Those comparisons resulting in a 
BLAST score of 70 (or in some cases 90) or greater that did not encode known proteins were clustered and 
assembled into consensus DNA sequences with the program "phrap" (Phil Green, University of Washington, 
ScatUe. Washington). The consensus sequence obtained therefrom is herein designated DNA45625. Proprietary 
Genemech EST sequences were employed in the assembly and are herein shown m Figures 58-61 . 

Based on the DNA45625 sequence, oligonucleotide probes were generated and used to screen a human 
fetal lung library .(UB25) prepared as described in paragraph 1 of Example 2 above. The cloning vector was 
pRK5B (pRKSB is a precursor of pRK5D that does not contain the Sfil site; see. Holmes et al.. Science . 
253:1278-1280 (1991)), and the cDNA size cut was less than 2800 bp. 
15 PCR primers (forward and reverse) were synthesized: 

forward PGR primer 5'-GGGTGGATGGTACTGCTGCATCC-3' (SEQ ID NO: 109) 
reverse PCR primer 5'-TGTTGTGCTGTGGGAAATCAGATGTG-3' (SEQ ID NO: 1 10) 
Additionally, a synthetic oligonucleotide hybridization probe was constructed from the DNA45625 sequence 
which had the following nucleotide sequence: 
20 hybridization probe 

5*-GTGTCTGGAGGCrGTGGCCGTTTTGrnTCTTGGGCTAAAATCGGG-3' (SEQ ID NO: 11 1) 

In order to screen several libraries for a source of a full-length clone. DNA from the libraries was 
screened by PCR amplification with the PCR primer pair ideniiried above. A positive library was then used to 
isolate clones encoding the PR0718 gene using the probe oligonucleotide and one of the PCR primers. 

25 A full length clone was identified that contained a single open reading frame with an apparent 

translational initiauon site at nucleotide positions 36-38 and ending at the stop codon found at nucleotide positions 
607^ (Figure 56; SEQ ID NO: 102). The predicted polypeptide precursor is 157 amino acids long, has a 
calculated molecular weight of approximately 17,400 daltons and an estimated pi of approximately 5.78. 
Analysis of the full-length PR0718 sequence shown in Figure 57 (SEQ ID NO: 103) evidences the presence of 

30 the following: a type II transmembrane domain from about amino acid 21 to about amino acid 40, and other 
transmembrane domains at about amino acid 58 to about amino acid 78, about amino acid 95 to about amino acid 
1 14. and about amino acid 127 to about amino acid 147; a cell attachment sequence from about amino acid 79 
to about amino acid 81 ; and a poicmial N-glycosylation site from about amino acid 53 to about ammo acid 56. 
Clone DNA49647-1398 has been deposited with ATCC on June 2, 1998 and is assigned ATCC deposit no. 

35 209919. 

Analysis of the amino acid sequence of the full-length PR0718 polypeptide suggests that it possesses 
no significant sequence similarity to any known protein. However, an analysis of the Day hoff database (version 



398 



wo 99/63088 



PCTAJS99/12252 



35.45 SwissProt 35) evidenced some degree of homology between the PR0718 amino acid sequence and the • 
following Dayhoff sequences: AF045606_1. AF039906_1, SPBC8D2J, S63441. F64728, COXl^TRYBB, 
F64375. E64173, RPYGJT_3. MTCY261_23. 

EXAMPLE 26 : Isolation of cDNA clones Encoding Human PR0872 
5 Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 

Incyie EST sequence designated herein as clul20709.init. The clul20709.init sequence was then compared a 

proprietary EST DNA database (LIFESEQ™. Incyte Pharmaceuticals, Palo Alto, CA) to identify existing 

homologies. The homology search was performed usmg the computer program BLAST or BLAST2 (Altshul 

ei al.. Methods in Enzvmologv 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or 
10 in some cases 90) or greater that did not encode known proteins were clustered and assembled into a consensus 

DNA sequence with the program "phrap" (Phil Green, University of Washington. Seatde, Washington). The 

consensus sequence obtained therefrom is herein designated DNA48254. 

In light of an observed sequence homology between the DN A48254 consensus sequence and an EST 

sequence encompassed within the Incyte EST clone no. 3438068, the Incyie EST clone 3438068 was purchased 
15 and the cDNA insen was obtained and sequenced. It was found that this insen encoded a full-length protein. 

The sequence of this cDNA insert is shown in Figure 63 and is the full-length DNA sequence for PR0872. 

Clone DNA49819-1439 was deposited with the ATCC on June 2, 1998, and is assigned ATCC deposit no. 

209931. 

The entire nucleotide sequence of DNA49819-1439 is shown in Figure 63 (SEQ ID N0:1 12). Clone 
20 DNA49819-1439 contains a single open reading frame with an apparent iranslational initiation site at nucleotide 
positions 14-16 and ending at the stop codon at nucleotide positions 1844-1846 (Figure 63). The predicted 
polypeptide precursor is 610 amino acids long (Figure 64). The full-length PR0872 protein shown in Figure 
64 has an estimated molecular weight of about 66,820 daiions and a pl of about 8.65. Analysis of the full-length 
PR0872 sequence shown in Figure 64 (SEQ ID NO: 1 13) evidences the presence of the following features: a 
25 signal peptide at amino acid 1 to about 18, putative transmembrane domains at about amino acids 70-87, 200-222 
and 568-588; sequence identity with bacterial-type i^ytoene dehydrogenase protein at about amino acids 7 1 - 105; 
sequence identity with a regulator of chromosome condensation (RCCl) signamre 2 at about amino acids 201- 
211; leucine zipper patterns at about amino acids 214-235, 221^242. 228-249 and 364-385; a potential N- 
glycosylation site at about amino acids 271-274; and a glycosaminoglycan attachment site at about amino acids 
30 75-78. Analysis of the amino acid sequence of the full-length PR0872 polypeptide using die Dayhoff database 
(version 35.45 SwissProt 35) evidenced homology between die PR0872 amino acid sequence and the following 
Dayhoff sequences: PRCRTI^l, S75951, S74689, CELF37C4^3, CRTI_^RHOCA. S76617, YNI2_METTL. 
MTV014_14, AOFB_HUMAN, and MMU70429_L 

35 EXAMPLE 27 : Isolation of cDNA clones Encoding Human PRO1063 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
Incyte EST cluster sequence designated herein as ss.clul 19743.init. The Incyte EST cluster sequence 
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ss.clul 19743.init sequence was then compared to a variety of expressed sequence tag (EST) databases which 
included public EST databases (e.g.. GenBank) and a proprietary EST DNA database (UFESEQ^, Incyie 
Pharmaceuticals, Palo Alto. CA) to identify existing homologies. The homology search was performed using 
the computer program BLAST or BLAST2(Altshul(Btal., Methods in Enzvmolopv 2f^f^'di^s^nQQA}\ Those 
comparisons resulting in a BLAST score of 70 (or in some cases 90) or greater that did not encode known 
proteins were clustered and assembled into a consensus DNA sequence with the program "phrap" (Phil Green. 
University of Washington, Seattle. Washington). The consensus sequence obtained therefrom is herein 
designated DNA48288. 

in light of an observed sequence homology between the DN A48288 consensus sequence and an EST 
sequence encompassed within die Incyte EST clone no. 2783726. the Incyte EST clone 2783726 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a fulMengUi protein. 
The sequence of this cDNA insen is shown m Figure 65 ami is herein designated DNA49820-1427. 

The ftill length clone shown in Figure 65 contained a single open reading frame widi ah apparent 
translational initiation site at nucleotide positions 90-92 and ending at the stop codon found at nucleotide positions 
993-995 (Figure 65; SEQ ID NO:l 14). The predicted polypeptide precursor is 301 amino acids long, has a 
calculated molecular weight of approximately 33,530 daltons and an estimated pi of approximately 4.80. 
Analysis of the fiill-lengih PRO1063 sequence shown in Figure 66 (SEQ ID NO: 1 15) evidences the presence of 
the following: a signal peptide from about amino acid 1 to about amino acid 21. potential N-glycosylaiion sites 
from about amino acid 195 to about amino acid 198. from about amino acid 217 to about amino acid 220 and 
from about amino acid 272 to about amino acid 275, a glycosaminoglycan attachment site from about amino acid 
267 to about amino acid 270, a microbodies C-terminal targeting signal site from about amino acid 299 to about 
amino acid 301, a type II fibroneain collagen-binding domain homology sequence from about amino acid 127 
to about amino acid 168 and a fructose-bisphosphaie aldolase class II protein homology sequence from about 
amino acid 101 to about amino acid 1 18. Clone DNA49820-1427 has been deposited with die ATCC on June 
2. 1998 and is assigned ATCC deposit no. 209932. 

Analysis of the amino acid sequence of the fiill-lengdi PRO 1063 polypeptide suggests that it possesses 
sequence similarity to the human type IV coUagenase protein. More specifically, an analysis of the Dayhoff 
database (version 35.45 SwissProt 35) evidenced some degree of homology between the PRO1063 amino acid 
sequence and die following Dayhoff sequences, S68303, CFU68533_1. P_P91139, RNU65656_1. 
PA2R_RABIT, MMU56734_1, FINC^XENLA. A48925, P^R92778 and FA12^HUMAN. 

PXAMPLE 28 : Isolation of cDNA clones En c oding Human PRQfilQ 

Use of the signal sequence algorithm described m Example 3 above allowed identification of an EST 
cluster sequence from the Incyte database, designated herein as 88434. This EST cluster sequence was then 
compared to a variety of expressed sequence tag (EST) databases which included public EST databases (e.g.. 
GenBank) and a proprietary EST DNA database (UFESEQ«. Incyie Pharmaceuticals. Palo Alto. CA) to identify 
existing homologies. The homology search was performed using the computer program BLAST or BLAST2 
(Alishul et al.. Methods in Enzymolopy 266:460^80 (1996)). Those comparisons resulting in a BLAST score 
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of 70 (or in some cases 90) or greater that did not encode known proteins were clustered and assembled into a 
consensus DNA sequence with the program "phrap" (Phil Green, University of Washington. Seattle, 
Washington). 

In light of an observed sequence homology between the consensus sequence and an EST sequence 
encompassed within the Incyte EST clone no. 1656694, the Incyte EST clone 1656694 was purchased and the 
5 cDNA insen was obtained and sequenced. It was found that this insen encoded a full-length protein. The 
sequence of this cDNA insen is shown in Figure 67 and is herein designated as DNA4982 1-1562. 

The full length clone shown in Figure 67 contained a smgle open reading frame with an apparent 
translational initiation site at nucleotide positions 81-83 and ending at the stop codon found at nucleotide positions 
450-452 (Figure 67; SEQ ID NO: 1 16). The predicted polypeptide precursor (Figure 68, SEQ ID NO: 1 17) is 
10 123 amino acids long including a predicted signal peptide at about amino acids 1-20. PR0619 has a calculated 
molecular weight of approximately 13,710 daltons and an estimated pl of approximately 5.19. Clone 
DNA49821-1562 was deposited with the ATCC on June 16, 1998 and is assigned ATCC deposit no. 209981. 

An analysis of the Dayhoff database (version 35.45 SwissProi 35), using a WU-BLAST-2 sequence 
alignment analysis of the full-length sequence shown in Figure 68 (SEQ ID NO: 117), revealed significant 
15 homology between the PR0619 amino acid sequence and the following Dayhoff sequences: S35302, D87009 1, 
HSU93494_1, HUMIGLAM5^1. D86999^2, HUMIGLYMI^K HUM1GLYMKE_1, A29491_^l, A29498_l, 
and VPR2^M0USE. 

EXAMPLE 29: Isolation of c DNA clones Encoding Human PR0943 

20 A consensus DNA sequence encoding PR0943 was assembled relative to other EST sequences using 

lArap as described in Example 1 above. This consensus sequence was then extended using repeated cycles of 
BLAST and phrap to extend the consensus sequence as far as possible using the sources of EST sequences 
discussed above. The extended consensus sequence is herein designated DNA36360. Based on the DNA36360 
consensus sequence, oligonucleotides were synthesized: 1) to identify by PCR a cDNA library that contained 

25 the sequence of interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for 
PR0943. 

PCR primers (forward and reverse) were synthesized: 
forward PCR primer 5'-CGAGATGACGCCGAGCCCCC-3 ' (SEQ ID NO: 120) 
reverse PCR primer 5*-CGGTTCGACACGCGGCAGGTG-3* (SEQ ID NO: 121) 
30 Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus DNA36360 
sequence which had the following nucleotide sequence 
hybridization pmhe 

5'-TGCrGCTCCTGCTGCCGCCGCrrGCrrGCTGGGGGCCTTCCCGCC(jG-3' (SEQ ID NO:122) 

In order to screen several libraries for a source of a full-length clone, DNA from the libraries was 
35 screened by PCR amplification with die PCR primer pair identified above. A positive library was then used to 
isolate clones encoding the PR0943 gene using the probe oligonucleotide and one of the PCR primers. RNA 
for construction of the cDNA libraries was isolated from human fetal brain tissue. 
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DNA sequencing of the clones isolated as described above gave the ftill-Iengib DNA sequence for 
PR0943 (designated herein as DNA52 192-1369 [Figure 69. SEQ ID NO: 1 18]) and the derived protein sequence 
for PR0943. 

The entire nucleotide sequence of DNA52192.1369 is shown in Figure 69 (SEQ ID NO: 1 18). Clone 
DNA52192-1369 contains a single open reading frame with an apparent translational initiation site at nucleotide 
5 positions 150-152 and ending at the aop codon at nucleotide positions 1662-1664 (Figure 69). The predicted 
polypeptide precursor is 504 amino acids long (Figure 70). The full-length PR0943 protein shown in Figure 
70 has an estimated molecular weight of about 54,537 daltons and a pi of about 10.04. Analysis of the full- 
length PR0943 sequence shown in Figure 70 (SEQ ID NO: 1 19) evidences the presence of the following: a signal 
peptide from about amino acid 1 to about amino acid 17, a transmembrane domain from about amino acid 376 

10 to about amino acid 396. tyrosine kinase phosplwrylation sites from about amino acid 212 to about amino acid 
219 and from about amino acid 329 to about amino acid 336. potemial N-glycosylation sites from about amino 
acid 1 1 1 to about amino acid 1 14, from about amino acid 23 1 to about amino acid 234. from about amino acid 
255 to about amino acid 258 and from about amino acid 293 to about amino acid 296 and an immunoglobulin 
and MHC protein sequence homology block from about amino acid 219 to about amino acid 236. Clone 

15 DNA52192-1369 has been deposited with ATCC on July 1, 1998 and is" assigned ATCC deposit no. 203042. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST-2 sequence 
alignment analysis of the full-length sequence shown in Figure 70 (SEQ ID NO: 119), evidenced significant 
homology between the PR0943 amino acid sequence and the following Dayhoff sequences: B49I51. A39752. 
FGR1_XENLA, S38579, RATHBFGFRB_1, TVHU2F. FGR2_M0USE, CEK3_CHICK, P_R21080 and 

20 A27171J. 

EXAMPLE 30: Isolation of cDNA clones Encoding Human PRO 1 1R« 

A consensus DNA sequence was assembled relative to other EST sequences using the program -phrap" 
as described in Example 1 above. This consensus sequence is designated herein as DNA45679. Based on the 
25 DNA45679 consensus sequence, oligonucleotides were synthesized: 1) to idemify by PGR a cDNA library that 
contained the sequence of interest, and 2) for use as probes to isolate a clone of the full-length coding sequence 
forPROI188. 

PGR primers (forward and reverse) were synthesized: 
forward PGR primer 5*-CTGGTGCGTCAACAGGGAGCAG-3* (SEQ ID NO: 125) 
30 reverse PGR primer 5'.GGATTGTGGAGGTCAGGTGAGAG-3* (SEQ ID NO:126) 

AdditionaUy, a synthetic oligonucleotide hybridization probe was constructed from the consensus 
DNA45679 sequence which had the following nucleotide sequence: 
hybridization probe 

5 -GTGGAGGAAGTGGTGAGGTGGGTGTGGTGAGAGTGGGGTG.3* (SEQ ID NO: 127) 
35 In order to screen several libraries for a source of a full-length clone, DNA from the libraries was 

screened by PGR amplification with the PGR primer pair identified above. A positive library was then used to 
isolate clones encoding the PROl 188 gene using the probe oligonucleotide and one of the PGR primers. RNA 
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for construction of the cDNA libraries was isolated from hiunan fetal kidney tissue. 

DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for 
PROI188 (designated herein as DNA52S98-1518 (Figure 71, SEQ ID NO:123]); and the derived protein 
sequence for PRO 1188. 

The entire coding sequence of PROI 188 is shown in Figure 71 (SEQ ID NO: 123). Clone DNA52598- 
5 1518 contains a single open reading frame with an apparent translaiional initiation site at nucleotide positions 
136-138 and an apparent stop codon at nucleotide positions 3688-3690. The predicted polypeptide precursor 
is 1184 amino acids long. The full-length PROI 188 protein shown in Figure 72 has an estimated molecular 
weight of about 132,582 Daltons and a pi of about 8.80. Additional features include: a signal peptide at about 
amino acids 1-31; an ATP/GTP binding site motif A (P-loop) at about amino acids 266-273; an aldehyde 
10 dehydrogenases cysteine active site at about amino acids 188-199; growth factor and cytokines receptors family 
signanire 2 at about amino acids 153-159; and potential N-glycosylation sites at about amino acids 129-132, 132- 
135, 346-349. 420-423. 550-553, 631-634, 1000-1003. and 1056-1059. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST-2 sequence 
alignment analysis of the full-length sequence shown in Figure 72 (SEQ ID NO: 124), revealed significant 
15 homology between the PROI 188 amino acid sequence and the foUowinig Dayhoff sequences: SSU83114_1. 
S56015, CET2 186^4, CELT19D2^1. and TSPl^MOUSE. 

Clone DNA52598-1518 has been deposited with ATCC and is assigned ATCC deposit no 203107. 

EXAMPLE 31 : Isolation of cDNA clones Encoding Human PRO! 133 

20 A consensus DNA sequence was assembled relative to other EST sequences using phrap as described 

in Example 1 above. This sequence was extended using repeated cycles of phrap. The extended consensus 
sequence is designated herein DNA38102. Based on the DNA38102 consensus sequence, oligonucleotides were 
synthesized: 1) to identify by PCR a cDNA library that contained the sequence of interest, and 2) for use as 
probes to isolate a clone of the full-length codmg sequence for PROI 133. 

25 PCR primers (two forward and one reverse) were synthesized: 

forward PCR primer 1 5'.TCGATTATGGACGAACATGGCAGC-^' (SEQ ID NO: 130); 
forward PCR primer 2 S'-TTCTGAGATCCCTCATCCTC-3' (SEQ ID NO: 131); and 
reverse primer 5'-AGGTTCAGGGACAGCAAGTTTGGG-3' (SEQ ID NO: 132). 

Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus 

30 DNA38I02 sequence which had the following nucleotide sequence: 
hybridization probe 

5*TTTGCTCKjACCTCGGCTACGGAATTGGCTTCCCTCrrACGGACAGCrrGGAT3* (SEQ ID NO: 133). 

In order to scr^n several libraries for a source of a fiill-lengih clone, DNA from the libraries was 
screened by PCR amplification with a PCR primer pair identified above. A positive library was then used to 
35 isolate clones encoding the PROI 133 gene using the probe oligonucleotide and one of the PCR primers. RNA 
for construction of the cDNA libraries was isolated from human fetal kidney tissue. 
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DNA sequencing of the clones isolated as described above gave the ftiU-Iength DNA sequence for 
PROl 133 and (be derived protein sequence for PROl 133. 

The entire coding sequence of PROl 133 is shown in Figure 73 (SEQ ID NO: 128). Clone DNA53913- 
1490 contains a single open reading frame with an apparent translational iniuaiion site at nucleotide positions 
266-268 and an ^n»nm stop codon at nucleotide positions 1580-1582 of SEQ ID NO:128. The predicted 
polypeptide precursor is 438 amino acids long. The signal peptide is at amino acids 1-18 of SEQ ID NO: 129. 
EGF-like domain cysteme panera signatures stan at 315 and 385 of SEQ ID NO: 129 as shown in Figure 74. 
Clone DNA53913-1490 has been deposited with ATCC and is assigned ATCC deposit no. 203162. The full- 
length PROl 133 protein shown in Figure 74 has an estimated molecular weight of about 49.260 daltons and a 
pi of about 6.15. 

An analysis of the Dayhoff database (version 35.45 SwissPtot 35). using a WU-BLAST-2 sequence 
alignment analysis of the fiill-length sequence shown in Figure 74 (SEQ ID NO: 129). revealed some sequence 
identity between die PROl 133 amino acid sequence and the following Dayhoff sequences (data from the database 
incorporated herein): AF002717_1. LMG1_HUMAN. B54665. UNC6_CAEEL. LML1_CAEEL. 
LMA5_M0USE, MMU88353_1. LMA1_HUMAN. HSLN2C64_1 and AFQ05258_1 . 



EXAMPLE 32: Isolation of cDNA clones Encoding Human PR0784 

An initial DNA sequence (SEQ ID N0:136), referred to herein as DNA44661 and shown in Figure 77. 
was identified using a yeast screen, in a human fetal lung cDNA library that preferentially represents the 5* ends 
of the primary cDNA clones. DNA44661 was then compared to ESTs from public databases (e.g.. GenBank), 
and a proprietary EST database (LIFESEQ*. Incyte Pharmaceuticals. Palo Alto. CA). using the computer 
program BLAST or BLAST2 fAltschul et al. . Methods in FnTvmnlnpY 266:460^80 ( 1 996)1 • The ESTs were 
then clustered and assembled imo a consensus DNA sequence using the computer program "phrap" (Phil Green. 
University of Washington. Seattle. Washington). The consensus sequence obtained is designated herein as 
"DNA45463". Based on the DNA45463 consensus sequence, oligonucleotides were synthesized for use as 
probes to isolate a clone of the fiill-length coding sequence for PR0784 from a human fetal lung cDNA library. 

The full length DNA53978-1443 clone shown in Figure 75 contained a single open reading frame witfi 
an appaiem translational initiation site at nucleotide positions 37-39 and ending at the stop codon found at 
nucleoUde positions 821-823 (Figure 75; SEQ ID NO:134). The predicted polypeptide precursor (Figure 76. 
SEQ ID NO: 135) is 228 amino acids long. PR0784 has a calculated molecular weight of approximately 25.735 
Daltons and an estimated pi of approximately 5.45. PR0784 has the followmgfeanires: a signal peptide at about 
amino acid 1 to about 15; transmembrane domains at about amino acids 68 to about 87 and at about 183 to about 
204; potential N-myristoylation sites at about amino acids 15-20. 51-56. 66-60. 163-168. and 206-211; and an 
RNP-1 protein RNA-binding region at about amino acids 108 to about 117. 

Clone DNA53978-1443 was deposited with ATCC on June 16. 1998. and is assigned ATCC deposit 
35 no. 209983. 

Based on a BLAST and FastA sequence alignment analysis (using the ALIGN computer program) of 
the fiill-length sequence. PR0784 shows amino acid sequence identity to the following proteins: RNU42209 1. 
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MMU91538_1. CGU91742^1. CELF55A4^6. SC22^YEAST. and F48188. 

EXAMPLE 33: Isolation of cDNA Clones Encoding Human PR0783 

A yeast serening assay was employed lo identify cDN A clones that encoded potential secreted proteins. 
Use of this yeast screening assay allowed identification of a single cDNA clone, designated herein as DNA45201 
5 (Figure 80; SEQ ID NO: 139). 

The DNA45201 sequence was then used to search expressed sequence tag (EST) databases for the 
presence of potential homologies. The EST databases included public EST databases (e.g.. GenBank) and a 
proprietary EST DNA database (LIFESEQ^. Incyte Pharmaceuticals, Palo Alto. CA). The search was 
performed using the computer program BLAST or BLAST2 (Aitshul ei al, . Methods in Enzymoinpy ^f^'^f^ 
10 480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) or greater that did not 
encode known proteins were clustered and assembled into a consensus DNA sequence with the program "phrap" 
(PhU Green, Univ. of Washington. SeatUe, Washington). The consensus sequence obtained is herein designated 
as -consenOr. A proprietary Genentech EST sequence was used in the consensus assembly and is herein 
designated as DNA14575 (Figure 81; SEQ ID NO:140). 

15 Based on the consenOl sequence, oligonucleotides were synthesized: 1) to identify by PCR a cDNA 

library that contained the sequence of interest, and 2) for use as probes to isolate a clone of the fulMength coding 
sequence for PR0783. In order to screen several libraries for a full-length clone. DNA from die libraries was 
screened by PCR amplification, as per Ausubel et aL, Current Protocols in Molecular Biolo^. wiUi the PCR 
primer pair. A positive library was then used to isolate clones encoding the gene of interest using the probe 

20 oligonucleotide and one of the primer pairs. 

PCR primers (forward and reverse) were synthesized: 
forward PCR primer S .r, A PTHT a rrrr, a r,r-rri- An a r-x^^r- y (S£q nO:141), 
forward PCR primer 5*-TCAGCAATGAGGTGCTGCrrC-3' (SEQ ID NO: 142), and 
npverse pCR primer S'-TGAGGAAGATGAGGGACAGGTTGG S' (SEQ ID NO: 143). 

25 Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consenOl 

sequence which had the following nucleotide sequence: 
hybridization prnhe 

5'-TATGGAAGCACCTGACTACGAAGTGCTATCCGTGCGAGAACAGCTATTCC-3' (SEQ ID NO:144). 
In order to screen several libraries for a source of a full-length clone. DNA from the libraries was 

30 screened by PCR amplification with a PCR primer pair identified above. A positive library was dien used to 
isolate clones encoding die PR0783 gene using the probe oligonucleotide and one of the PCR primers. 

RNA for construction of the cDNA libraries was isolated from human fetal kidney tissue (LIB228). 
The cDNA libraries used to isolate the cDNA clones were constructed by standard methods using commercially 
available reagents such as those from Invitrogen, San Diego. CA. The cDNA was primed widi oligo dT 

35 containing a NotI site, linked with blunt to Sail hemikinased adaptors, cleaved with NotI, sized appropriately 
by gel electrophoresis, and cloned in a defmed orientation into a suitable cloning vector (such as pRKB or 
pRKD; pRKSB is a precursor of pRK5D that does not contain the Sfil site; see. Holmes ei al.. Science . 
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252: 1278-1280 (1991)) in the unique Xhol and NotI sites. 

DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for 
PR0783 [herein designated as DNA53996-1442] (SEQ ID NO: 137) and the derived protein sequence for 
PR0783. 

The entire nucleotide sequence of DNA53996-1442 is shown in Figure 78 (SEQ ID NO: 137). Clone 
5 DNA53996-1442 contains a single open reading frame with an apparent translational initiation site at nucleotide 
positions 310-312 and ending at the stop codon at nucleotide positions 1777-1779 (Figure 78). The predicted 
polypeptide precursor is 489 amino acids long (Figure 79). The full-length PR0783 protein shown in Figure 
79 has an estimated molecular weight of about 55.219 daltons and a pi of about 8.47. Analysis of the full-length 
PR0783 sequence shown in Figure 79 (SEQ ID NO: 138) evidences the presence of the following features: 
10 transmembrane domains located at about amino acids 23-42, 67-89. 111-135, 154-176, 194-218. 296-319, 
348-370, 387-410 and 427-452; leucine zipper patterns located at about amino acids 263-283 and 399-420; a 
potential tyrosine kinase phosphorylation site at about amino acids 180-187; potential N-glycosylation sites at 
about amino acids 105 -108 and 121-124; potential cAMP- and a cGMP-dependent protein kinase 
phosphorylation site at about amino acids 288-291; and a region having sequence identity with bacterial 
IS rhodopsins retinal binding site protein at about amino acids 190-218. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35) shows some sequence identity 
between the PR0783 amino acid sequence and the following Dayhoff sequences: YNC2_CAEEL, D64048, 
ATACX)02332JF4P9.3, NY2R^SHEEP, and VSH^MUMPA. 

Clone DNA53996-1442 was deposited with the ATCC on Jime 2. 1998, and is assigned ATCC deposh 
20 no. 209921. 

EXAMPLE 34 : Isolation of cDNA Clones Encoding Human PRO820 

An expressed sequence tag (EST) DNA database (Merck/Wash. U) was searched and an EST designated 
EST no. AA504080, Merck clone 825136, was identified (library 3 12, human B-cell tonsil). Homology searches 

25 revealed that this EST showed sequence identity with low affmity immunoglobulin gamma Fc receptor II. DNA 
sequencing gave the full-length DNA sequence for PRO820 and the derived protein sequence for PRO820. 

The emire nucleotide sequence of DNA56041-1416 is shown in Figure 82 (SEQ ID NO: 145). Clone 
DNA56041-1416 contains a single open reading frame with an apparent translational initiation site at nucleotide 
positions 115-117 and ending at the stop codon at nucleotide positions 487-489 (Figure 82). The predicted 

30 polypeptide precursor is 124 amino acids long (Figure 83). The full-length PRO820 protein shown m Figure 
83 has an estimated molecular weight of about 14.080 daltons and a pl of about 7.48. Clone DNA56041-1416 
has been deposited with ATCC. Regarding the sequence, it is understood that the deposited clone contains the 
correct sequence, and the sequences provided herein are based on known sequencing techniques. 

Still analyzing the amino acid sequence of SEQ ID NO: 146. the putative signal peptide is at about amino 

35 acids 1-15 of SEQ ID NO:146. Protein kinase C phosphorylation sites are at about amino acids 20-22 and 43-45 
of SEQ ID NO: 146. An N-myristoylation site is at about amino acids 89-94 of SEQ ID NO: 146. An 
Immunoglobulin and major histocompatibility complex domain is at about amino acids 83-90 of SEQ ID NO: 146. 
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The corresponding nucleotides can be routinely determined given the sequences provided herein. 

EXAMPLE 35: Isclaiion of cDNA CAone^ Encoding Human PRO108Q 

A consensus DNA sequence was assembled relative to other EST sequences using phrap and was 
extended using repeated cycles of BLAST and phrap so as to extend the consensus sequence as far as possible 
using the sources of the EST sequences as descrited in Example 1 above. The consensus sequence is designated 
herein as DNA52640. An EST proprietary to Genentech was employed in the consensus assembly and is herein 
designated as DNA36527 (Figure 86; SEQ ID NO: 149). 

In light of an observed sequence homology between the DNA36527 consensus sequence and an EST 
sequence encompassed within the Merck EST clone no. 526423. the Merck EST clone 526423 was purchased 
and the cDNA insert was obtained and sequenced: It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 84 and is herein designated as DNA56047-1456. 

The entire nucleoUde sequence of DNA56047-1456 is shown in Figure 84 (SEQ ID NO: 147), Clone 
DNA56047-1456 contains a single open reading frame with an apparent translaiional initiation site at nucleotide 
positions 159-161 and endmg at the stop codon at nucleotide positions 1233-1235 of SEQ ID NO:147 (Figure 
84). The predicted polypeptide precursor is 358 amino acids long (Figure 85). The full-length PRO1080 protein 
shown in Figure 85 has an estimated molecular weight of about 40,514 dalions and a pi of about 6.08. Clone 
DNA56047-1456 has been deposited with ATCC on June 9. 1998. It is understood that the deposited clone has 
the acmal nucleic acid sequence and that the sequences provided herein are based on known sequencing 
techniques. 

Also shown in Figure 85 are the approximate locations of the signal peptide, cell attachment site, Nt- 
DnaJ domain signature, region having sequence identity with Nt-DnaJ domain proteins, and N-glycosylation 
sites. The corresponding nucleic acids of these amino acid sequences and oUiers provided herein can be 
routinely determined by the information provided herein. 

EXAMPLE 36! Isolation of cDNA Clones Encoding Human PRO 1079 

A consensus DNA sequence was assembled relative to other EST sequences using phrap as described 
in Example 1 above, and is herein designated DNA52714. Based on information provided by the assembly, the 
clone for Merck EST no. H06898 was obtained and sequenced, thereby giving the nucleotide sequence 
designated herein as DNA56050-1455. The entire nucleotide sequence of DNA56050-1455 is shown in Figure 
87 (SEQ ID NO: 150). Clone DNA56050-1455 contains a single open reading frame with an apparent 
translational initiation site at nucleotide positions 183-185 and ending at die stop codon at nucleotide positions 
861-863 (Figure 87). The predicted polypeptide precursor is 226 amino acids long (Figure 88). The full-length 
PRO1079 protein shown m Figure 88 has an estimated molecular weight of about 24,61 1 Daltons and a pi of 
about 4.85. Analysis of the full-length PRO1079 sequence shown in Figure 88 (SEQ ID N0:3) evidences the 
presence of the following feanires: a signal peptide at about amino acid 1-29; potential N-myristoylation sites 
at about amino acids 10-15, and 51-56; homology to photosysiem I psaG and psaK proteins at about amino acids 
2 to 20; and homology to prolyl endopepiidase family serine proteins at about amino acids 150 to 163. 
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Analysts of the amino acid sequence of the full-length PRO 1079 polypeptide using the Dayhoff database 
(version 35.45 SwissProt 35) evidenced some sequence identity between the PRO1079 amino acid sequence and 
the following Dayhoff sequences: CEK10C3_^4, MMU50734^1, D69503, AF051 149^1, and VSMP^CVMS. 

Clone UNQ536 (DNA56050-1455) was deposited with the ATCC on June 22, 1998. and is assigned 
ATCC deposit no. 20301 1. 

5 

EXAMPLE 37: Isolation of cDNA clones Encoding Human PR0793 

A cDNA clone (DNA56 110-1437) encoding a native human PR0793 polypeptide was identified by a 
yeast screen, in a human skin tumor cDNA library that preferemially represents the 5* ends of the primary 
cDNA clones. The yeast screen employed identified a single EST clone designated herein as DNA50177 (Figure 

10 91; SEQ ID NO: 154). The DNA50177 sequence was then compared to various EST databases including public 
EST databases (e.g. , GenBank), and a proprietary EST database (LIFESEQ^, Incyte niarmaceuticals. Palo Alto, 
CA) to identify homologous EST sequences. The con^>arison was performed using the computer program 
BLAST or BLAST2 [Altschul et al.. Methods in Enzvmolojgv. 266:460-480 (1996)]. Those con^wrisons 
resulting in a BLAST score of 70 (or in some cases, 90) or greater that did not encode known proteins were 

IS clustered and assembled into a consensus DNA sequence with the program ^phrap" (Phil Green, University of 
Washington, Seattle, Washington). This consensus sequence is herein designated DNA50972. 

In light of an observed sequence homology between the DNA50972 consensus sequence and an EST 
sequence encompassed within the Merck EST clone no, N33874, the Merck EST clone N33874 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insen encoded a lull-length protein. 

20 The sequence of this cDNA insert is shown in Figure 89 and is herein designated as DNA561 10-1437. 

The full-length DNA561 10-1437 clone shown in Figure 89 contains a single open reading frame with 
an apparent translational initiation site at nucleotide positions 77-79 and ending at the stop codon at nucleotide 
positions 491-493 (Figure 89). The predicted polypeptide precursor is 138 amino acids long (Figure 90). The 
full-length PR0793 protein shown in Figure 90 has an estimated molecular weight of about 15,426 daltons and 

25 a pi of about 10.67. Analysis of the full-length PR0793 sequence shown in Figure 90 (SEQ ID NO: 153) 
evidences the presence of the following: transmembrane domains from about amino acid 12 to about amino acid 
30, from about amino acid 33 to about amino acid 52, from about amino acid 69 to about amino acid 89 and 
from about amino acid 93 to about amino acid 109, potential N-myristolation sites from about amino acid 1 1 to 
aboiu amino acid 16, from about amino acid 51 to about amino acid 56 and from about amino acid 1 16 to about 

30 amino acid 121 and an amino acid sequence block having homology to an aminoacyi-transfer RNA synthetase 
class-II protein from about amino acid 49 to about amino acid 59. Clone DNA561 10-1437 has been deposited 
with ATCC on August 1 1 , 1998 and is assigned ATCC deposit no. 203 113. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST-2 sequence 
alignment analysis of the full-length sequence shown in Figure 90 (SEQ ID NO: 153), evidenced cenain 

35 homology between the PR0793 amino acid seqtience and the following Dayhoff sequences: S47453, 
AF015193_12, MTEHGNS9J, E64030, H69784. D64995. CD53^MOUSE, GEN8006, AH)01I38_7 and 
C0X2_STRPU. 
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EXAMPLE 38: Isolation of cDNA Clones Encoding Human PRO 1016 

A consensus DNA sequence was assembled relative to other EST sequences using pbrap as described 
in Example I above. The consensus sequence obtained is herein designated DNA53502. 

In light of an observed sequence homology between the DNA53502 consensus sequence and an EST 
sequence encompassed within the Merck EST clone no. 38680, the Merck EST clone 38680 was purchased and 
S the cDNA insen was obtained and sequenced. It was found that this insen encoded a full-length protein. The 
sequence of this cDNA insen is shown in Figure 92. 

The entire nucleotide sequence of DNA561 13-1378 is shown in Figure 92 (SEQ ID NO: 155). Clone 
DNA56I13-1378 contains a single open reading frame with an apparent translational initiation site at nucleotide 
positions 168-170 and ending at the stop codon at nucleotide positions 1302-1304 (Figure 92). The predicted 
10 polypeptide precursor is 378 amino acids long (Figure 93). The full-length PR01016 protein shown in Figure 
93 has an estimated molecular weight of about 44,021 daltons and a pi of about 9.07. Clone DNA561 13-1378 
has been deposited widi the ATCC. Regarding the sequence, it is understood that the deposited clone contams 
the correct sequence, and the sequences provided herein are based on known sequencing techniques. 

Analysis of the amino acid sequence of the full-length PRO 1016 polypeptide suggests that ponions of 
IS it possess sequence identity with acyltransferase, thereby indicating that PRO 10 16 may be a novel 
acyltransferase. 

Stiil analyzing the amino acid sequence of SEQ ID NO: 156, the putative signal peptide is at about amino 
acids 1-18 of SEQ ID NO: 156. The transmembrane domain(s) are at about amino acids 332-352 and 305-330 
of SEQ ID NO: 156. The fructose-bisphosphate aldolase class-II protein homology sequence is at about amino 
20 acids 73-90 of SEQ ID NO: 156. The extradiol ring-cleavage dioxygenase protein is at about amino acids 252- 
275 of SEQ ID NO: 156. The corresponding nucleotides can be routinely determined given the sequences 
provided herein. 

The specific Dayhoff database designation names of sequences to which PRO1016 has sequence identity 
with include the following: S52645, P^R59712, P_R99249. P_R59713. BNAGPATRF_1, CELT05H4_15 and 
25 CEL2K40_1. 

E?CAMPLE 39:Isolatioyi of cDNA Encoding Human PROlOp 

A consensus DNA sequence was assembled relative to other EST sequences using phrap as described 
in Example 1 above. The consensus DNA sequence was then extended using repeated cycles of BLAST and 

30 phrap to extend the consensus sequence as far as possible using the sources of EST sequences. 

In light of an observed sequence homology between die consensus sequence and an EST sequence 
encompassed within the Incyte EST clone no. 3107695. the Incyte EST clone 3107695 was purchased and the 
cDNA insen was obtained and sequenced. It was found that this insen encoded a full-length protein. The 
sequence of this cDNA insen is shown in Figure 94 and is herein designated as DNA564 10-1414. 

35 The entire nucleotide sequence of DNA56410-1414 is shown in Figure 94 (SEQ ID NO: 157). Clone 

DNA56410-1414 contains a single open reading frame with an apparent translational initiation site at nucleotide 
positions 17-19 and ending at the stop codon at nucleotide positions 1244-1246 (Figure 94). The predicted 
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polypeptide precursor is 409 amino acids long (Figure 95), The full-length PRO1013 protein shown in Figure 
95 has an estimated molecular weight of about 46»662 daltons and a pi of about 7.18. Clone DNA56410-1414 
has been deposit^ with the ATCC, Regarding the sequence, it is understood that the deposited clone contains 
the correct sequence, and the sequences provided herein are based on known sequencing techniques. 

Stili analyzing the amino acid sequence of SEQ ID NO: 158, the putative signal peptide is at about amino 
5 acids 1-19 of SEQ ID NO: 158. N-glycosylation sites are at about amino acids 75-78 and 322-325 of SEQ ID 
NO: 158. An N-myristqylation site is at about amino acids 184-189 of SEQ ID NO: 158. A growth factor and 
cytokine receptor family domain is at about amino acids 134-149 of SEQ ID NO: 158. The corresponding 
nucleotides can be routinely determined given the sequences provided herein. 

Blast analysis showed some sequence identity with other proteins. Specifically, PRO1013 has some 
10 sequence identity with at least the Dayhoff sequences designated: D63877 1 ; MHU22019_1 , AE000730 10, and 
AF019079_1. 

EXAMPLE 40: Isolation of cDNA Clones Encoding Human PR0937 

A consensus DNA sequence was assembled relative to other EST sequences using phrap as described 
IS in Example 1 above. That consensus sequence is herein designated DNA49651. Based on the DNA4965I 
consensus sequence, oligonucleotides were synthesized: 1) to identify by PCR a cDNA library that contained 
the sequence of interest, and 2) for use as probes to isolate a clone of the full-length coding seqiience for 
PR0937. 

PCR primers (forward and reverse) were synthesized: 
20 forward PCR primer S'-CTCCGTCGTAAACCCCACAGCrr.^' (SEQ ID NO: 161); and 
reverse PCR primer S'-TCACATrnATnGGATCrATGArrn-';' (SEQ ID NO: 162). 
Additionally, a synthetic oligonucleotide hybridization probe was constructed from the DNA48651 sequence 
which had the following nucleotide sequence: 
hvbridization probe 

25 5*-GGTCTCGTGACTGTGAAGCCATGTTACAACTACTGCTCAAACATCATGAG-3* (SEQ ID NO: 163). 

In order to screen several libraries for a source of a full-length clone, DNA torn the libraries was 
screened by PCR amplification with the PCR primer pair identified above. A positive library was then used to 
isolate clones encoding the PR0937 gene using the probe oligonucleotide and one of the PCR primers. RNA 
for construction of the cDNA libraries was isolated from human fetal kidney tissue (LIB227). 

30 DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for 

PR0937 [herein designated as DNA56436-1448] (SEQ ID NO: 159) and the derived protein sequence for 
PR0937. 

The entire nucleotide sequence of DNA56436-1448 is shown in Figure 96 (SEQ ID NO: 159). It 
contains a single open reading frame having an apparent translaiional initiation site at nucleotide positions 499- 
35 501 and ending at the stop codon found at nucleotide positions 2167-2169 (Figure 96, SEQ ID NO: 159). The 
predicted polypeptide precursor is 556 amino acids long, has a calculated molecular weight of approximately 
62,412 daltons and an estimated pi of approximately 6.62. Analysis of the full-length PR0937 sequence shown 
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in Figure 97 (SEQ ID NO: 160) evidences the presence of the following features: signal peptide at about amino 
acids 1-22; ATP/GTP-binding site motif A (P-loop) at about amino acids 515-523; a potential N-glycosylaiion 
site at about amino acids 514-517; and sites of glypican homology at about amino acids 54-74, 106-156, 238- 
279. 309-345, 423^59, and 468-505. 

Clone DNA56436-1448 has been deposited with ATCC on May 27. 1998, and is assigned ATCC 
5 deposirno. 209902. 

Analysis of the amino acid sequence of the full-length PR0937 polypeptide suggests that it possesses 
significant sequence similarity to glypican proteins, thereby indicating that PR0937 may be a novel glypican 
protein. More specifically, an analysis of the Dayhoff database (version 35.45 SwissProt 35) evidenced 
significant homology between the PR0937 amino acid sequence and the following Dayhoff sequences: 
10 GPCK_MOUSE, GPC2^RAT, GPC5_HUMAN, GPC3^HUMAN, P^R30168, CEa)3H12_.2. GEN13820, 
HS119E23_1, HDAC_DROME. and AF017637_1. 

EXAMPLE 41: Isolation of cDNA clones Encoding Human PR0842 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 

15 Incyte EST cluster sequence designated herein as Incyte EST cluster sequence no. 69572. This EST cluster 
sequence was then compared to a variety of expressed sequence tag (EST) databases which included public EST 
databases (e.g. . GenBank) and a proprietary EST DNA database (LIFESEQ®. Incyte Pharmaceuticals, Palo Alto, 
CA) to identify existing homologies. The homology search was performed using the computer program BLAST 
or BLAST2 (Altshul et al., Methods in Enzvmology 266:460-480 (1996)). Those comparisons resulting in a 

20 BLAST score of 70 (or in some cases 90) or greater that did not encode known proteins were clustered and 
assembled into a consensus DNA sequence with the program "phrap" (Phil Green, University of Washington, 
Seattle, Washington). The consensus sequence obtained therefrom is herein designated DNA54230. 

In light of an observed sequence homology between the consensus sequence and an EST sequence 
encompassed within the Merck EST clone no. AA477092, the Merck EST clone AA477092 was purchased and 

25 the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. The 
sequence of this cDNA insert is shown in Figure 98 and is herein designated as DNA56855-1447. 

The full length clone shown in Figure 98 contained a smgle open reading frame with an apparent 
translational initiation site at nucleotide positions 153-155 and ending at the stop codon found at nucleotide 
positions 510-512 (Figure 98; SEQ ID NO: 164). The predicted polypeptide precursor (Figure 99, SEQ ID 

30 NO: 165) is 119 amino acids long. PR0842 has a calculated molecular weight of approximately 13,819Dalions 
aittl an estimated pi of awroximately 11.16. Other features of PR0842 include a signal peptide at about amino 
acids 1-22, a potential protein kinase C phosphorylation site at about amino acids 39-41 and two potential N- 
myristoylation sites at about amino acids 27-32 and about amino acids 46-51. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST-2 sequence 

35 alignmem analysis of the full-length sequence shown in Figure 98 (SEQ ID NO: 164), evidenced some homology 
between the PR0842 amino acid sequence and the following Dayhoff sequences: CEZK13l_l 1. P_R80843, 
RAT5HT2X_1, S81882^l. A60912, MCU60315.137MC137L, U93422.1, p^P91996, U93462_l, and 
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ZN18_^HUMAN. 

Clone DNA56855-1447 was deposited with the ATCC on Jime 23, 1998, and is assigned ATCC deposit 
no. 203004. 

EXAMPLE 42 : Isolation of cDN A clones Encoding Human PR0839 
5 Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST 

cluster sequence from the Incyte UFESEQ^^ database, designated Incyte EST Cluster No. 24479. This EST 
cluster sequence was then compared to a variety of expressed sequence tag (EST) databases which included 
public EST databases (e.g.. CienBank) and a proprietary EST DNA database (LIFESEQ®. Incyte 
Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The homology search was performed using 
10 the computer program BLAST or BLAST2(Altshuletal., Methods in Enzvmologv 266:460-480 (\996^\. Those 
comparisons resulting in a BLAST score of 70 (or in some cases 90) or greater that did not encode known 
proteins were clustered and assembled into a consensus DNA sequence with the program "phrap" (Phil Green, 
University of Washington, Seattle, Washington). The consensus sequence obtained therefrom is herein 
designated DNA55709. 

15 In light of an observed sequence homology between the DNA55709 consensus sequence and an EST 

sequence encompassed within the Merck EST clone no. 754525, the Merck EST clone 754525 was purchased 
and the cDNA insen was obtained and sequenced. It was found that this insen encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 100 and is herein designated as DNA56859-1445. 

The fiill length clone shown in Figure 100 contained a single open reading frame with an apparent 

20 iranslational initiation site at nucleotide positions 2-4 and ending at the stop codon found at nucleotide positions 
263-265 (Figure 100; SEQ ID NO: 166). The predicted polypeptide precursor (Figure 101. SEQ ID NO: 167) 
is 87 amino acids long. PR0839 has a calculated molecular weight of approximately 9,719 Dalions and an 
estimated pl of approximately 4.67. Other feanires of PR0839 include a signal peptide at about amino acids 1- 
23, potential protein kinase C phosphorylation sites at about amino acids 37-39 and about amino acids 85-87. 

25 a potential casein kinase II phosphorylation site at about amino acids 37-40. sequence identity with ribonucleotide 
reductase large subunit protem at about amino acids 50-60, and sequence identity with eukaryotic RNA-binding 
region RNP-1 proteins at about amino acids 70-79. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST-2 sequence 
alignment analysis of the full-length sequence shown in Figure 101 (SEQ ID NO: 167), evidenced some 

30 homology between the PR0839 amino acid sequence and the following Dayhoff sequences: CD14_M0USE, 
XPR6_YARU, HS714385^1. S49783, BB19^RABrr, GVPH-HALME. AB003135^1. P^R85453, 
LUU27081_2, and TP2B_MOUSE. 

Clone DNA56859-1445 was deposited with the ATCC on June 23, 1998, and is assigned ATCC deposit 
no.209019. 

35 
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EXAMPLE 43: Isolation of cDNA Clones E ncoding Human PROl 180 

Use of the signal sequence algorithm described in £xanq>le 3 above allowed identification of a single 
Incyte EST cluster sequence (Incyte EST cluster sequence no. 14732). The Incyte EST cluster sequence no. 
14732 sequence was then compared to a variety of expressed sequence tag (EST) databases which included public 
EST databases (e.g. , GenBank) and a proprietary EST DN A database (LIFESEQ™, Incyte Pharmaceuticals, Palo 
S Alto, CA) to identify existing homologies. The homology search was performed using the computer program 
BLAST or BLAST2 (Altshul et al. , Methods in Enzvmology 266 :460-480 (1996)). Those comparisons resulting 
in a BLAST score of 70 (or in some cases 90) or greater that did not encode known proteins were clustered and 
assembled into a consensus DN A sequence with the program "phrap** (Phil Green, University of Washington, 
Seattle, Washington). The consensus sequence obtained therefrom is herein designated DNA557n. 

10 In light of an observed sequence homology between the DNA5571 1 consensus sequence and an EST 

sequence encompassed within the Merck EST clone no. T6098U the Merck EST clone T60981 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insen encoded a fiilMength protein. 
The sequence of this cDNA insen is ^own in Figure 102 and is herein designated DNA56860-1510. 

The full length clone shown in Figure 102 contained a single open reading frame with an apparent 

1 5 translational initiation site at nucleotide positions 78-80 and ending at the stop codon found at nucleotide positions 
909-911 (Figure 102; SEQ ID NO: 168). The predicted polypeptide precursor is 277 amino acids long, has a 
calculated molecular weight of approximately 31,416 daltons and an estimated pi of approximately 8.88. 
Analysis of the full-length PROl 180 sequence shown in Figure 103 (SEQ ID NO: 169) evidences the presence 
of the following: a signal peptide from about amino acid 1 to about amino acid 23, a leucine zipper pattern 

20 sequence from about amino acid 10 to about amino acid 3 1 . and potential N-myristolation sited from about amino 
acid 64 to about amino acid 69, from about amino acid 78 to about amino acid 83, from about amino acid 80 
to about amino acid 85, from about amino acid 91 to about amino acid 96 and from about amino acid 201 to 
about amino acid 206. Clone DNA56860-1510 has been deposited with the ATCC on June 9. 1998 and is 
assigned ATCC deposit no. 209952. 

25 Analysis of the amino acid sequence of the full-length PROl 180 polypeptide suggests that it possesses 

sequence similarity to the methyltransferase family of proteins. More specifically, an analysis of the Dayhoff 
database (version 3S.4S SwissProt 35) evidenced some degree of homology between the PROl 180 amino acid 
sequence and the following Dayhoff sequences, MTCI65_I4, D69267, YH09_YEAST, BIOC:_SERMA, 
ATAC00448415T1D16.16, SHGCPIR_18, SPBC3B9^4, AB009504^14, P^W17977 and A69952. 

30 

EXAMPLE 44: Isolation of cDNA clones Encoding Human PROl 134 

Use of die signal sequence algorithm described in Example 3 above allowed identification of an EST 
cluster sequence from the Incyte database, designated 751 1. This EST cluster sequence was dien compared to 
a variety of expressed sequence tag (EST) databases M^hich included public EST databases (e.g., GenBank) and 
35 a proprietary EST DNA database (Lifeseq®, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing 
homologies. The homology search was performed using the computer program BLAST or BLAST2 (Altshul 
ci al.. Methods in Enzvmoloev 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or 
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in some cas« 90) or greater that did not encode known proteins were clustered and assembled into a consensus 
DNA sequence with the program "phrap" (Phil Green, University of Washington. Seattle, Washington). The 
consensus sequence obtained therefrom is herein designated DNA55725. Two proprietary Genentech EST 
sequences were employed in the assembly and are shown in Figure 106 (SEQ ID NO: 172) and Figure 107 (SEQ 
ID NO: 173). 

5 In light of an observed sequence homology between the DNA55725 consensus sequence and an EST 

sequence encompassed within the Merck EST clone no. H94897. the Merck EST clone H94897 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 106 and is herein designated as DNA5686S*1491. 

Clone DNA56865-149I contains a single open reading frame with an ^parent translational initiation 

10 site at nucleotide positions 153-155 and ending at the stop codon at nucleotide positions 1266-1268 (Figure 104). 
The predicted polypeptide precursor is 371 amino acids long (Figure 105). The fiilMcngth PROl 134 protein 
shown in Figure 105 has an estimated molecular weight of about 4 1 ,935 daltons and a pi of about 9.58. Analysis 
of the full-length PR01134 sequence shown in Figure 105 (SEQ ID NO: 171) evidences the presence of the 
following: a signal peptide from about amino acid 1 to about amino acid 23. potential N-glycosylation sites from 

15 about amino acid 103 to about amino acid 106. from about amino acid 249 to about amino acid 252 and from 
about amino acid 257 to about amino acid 260, and an amino acid block having homology to tyrosinase CuA- 
binding region proteins from about amino acid 280 to about amino acid 306. Clone DNA56865-1491 has been 
deposited with ATCC on June 23. 1998 and is assigned ATCC deposit no. 203022. 

An analysis of the Dayhoff database (version 35,45 SwissProi 35), using a WU-BLAST-2 sequence 

20 alignment analysis of the full-length sequence shown in Figure 105 (SEQ ID NO: 171), evidenced significant 
homology between the PROl 134 amino acid sequence and the following Dayhoff sequences: F20P5_18, 
AC002396^10, S47847, C64146. GSPA^BACSU, P^W10564, RFAI^ECOLI, Y258_HAE1N. RFAJ^SALTY 
and P_R32985. 

25 EXAMPLE 45: Isolation of cDNA clones Encoding Human PR083Q 

Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST 
chistcr sequence from the Incytedatabase, designated 2025 1 . This EST cluster sequence was then compared to 
a variety of expressed sequence tag (EST) databases which included public EST databases (e.g.. GenBank) and 
a proprietaiy EST DNA database (LIFESEQ* Incyte Pharmaceuucals, Palo Alto, CA) to identify existing 

30 homologies. The homology search was performed using the computer program BLAST or BLAST2 (Altshul 
«^ Methods in Enzvmolopv 266:460480 (1996)). Those comparisons resulting in a BLAST score of 70 (or 
in some cases 90) or greater that did not encode known proteins were clustered and assembled into a consensus 
DNA sequence with the program "phrap" (Phil Green. University of Washington, Seattle, Washington). The 
consensus sequence obtained therefrom is herein designated DNA55733. 

35 In light of an observed sequence homology between the DNA55733 consensus sequence and an EST 

sequence encompassed within the Merck EST clone no. H78534, the Merck EST clone H78534 was purchased 
and the cDNA insen was obiamed and sequenced. It was found that this insert encoded a fiill-length protein. 
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The sequence of chb cDNA inseit is shown in Figure 108 and is herein designated as DNA56866-1342. 

Clone DNA56866-1342 contains a single open reading frame with an apparent translational initiation 
site at nucleotide positions 154-156 and ending at the stop codon at nucleotide positions 415-417 (Figure 108). 
The predicted polypeptide precursor is 87 amino acids long (Figure 109). The full-length PRO830 protein shown 
in Figure 109 has an estimated molecular weight of about 9,272 daltons and a pi of about 9. 19. Analysis of the 
5 full-length PRO830 sequence shown in Figure 109 (SEQ ID NO: 175) evidences the presence of ihe following: 
a signal peptide from about amino acid 1 to atom amino acid 33, potential N-myristoyiation sites from about 
amino acid 2 to about amino acid 7 and from about amino acid 8 to about amino acid 13 and a thioredoxin family 
of proteins homology block from about amino acid 23 to about amino acid 39. Clone UN(}470 (DNA56866- 
1342) has been deposited with ATCC on June 22, 1998 and is assigned ATCC deposit no. 203023. 
10 An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST-2 sequence 

alignment analysis of the full-length sequence shown in Figure 109 (SEQ ID NO: 175), evidenced significant 
homology between the PRO830 amino acid sequence and the following Dayhoff sequences: HSU88154J. 
HSU88153_1, SAPKSGENE_1, HPU3179I_5. GGCN0T2^1, CPU91421_1, CHKESTPC09^1 , PQ0769. 
U97553^79 and B60095. 

15 

EXAMPLE 46: Isolation of cDNA clones Encoding Human PROl 1 15 

Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST 
cluster sequence from the UFESEC^ database, designated Incyie EST cluster sequence no. 165008. This EST 
cluster sequence was then compared to a variety of expressed sequence tag (EST) databases which included 

20 public EST databases (e.g., GenBank) and a proprietary EST DNA database (LIFESEQ®, Incyte 
Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The homology search was performed using 
the computer program BLAST or BLAST2(Altshulet ah. Methods in Enzvmology 266:460480 ( 1996)). Those 
comparisons resulting in a BLAST score of 70 (or in some cases 90) or greater that did not encode known 
proteins were clustered and assembled into a consensus DNA sequence with the program ^'phrap" (Phil Green, 

25 University of Washington, Seattle. Washington). The consensus sequence obtained therefrom is herein 
designated DNA55726. 

In light of an observed sequence homology between the DNA55726 consensus sequence and an EST 
sequence encompassed within the Merck EST clone no. R75784, the Merck EST clone R75784 was purchased 
and the cDNA insen was obtained and sequenced. It was found that this insen encoded a full-length protein. 

30 The sequence of diis cDNA insen is shown in Figure 1 1 1 and is hereui designated as DNA56868-1478. 

The full length clone shown in Figiu^ 110 contained a single open reading frame with an apparent 
translational initiation site at nucleotide positions 189-191 and ending at the stop codon found at nucleotide 
positions 1524-1526 (Figure 110; SEQ ID NO: 176). The predicted polypeptide precursor (Figure 111. SEQ 
ID NO: 177) is 445 amino acids long. PROl 1 15 has a calculated molecular weight of approximately 50.533 

35 Daltons and an estimated pl of approximately 8.26. Additional feamres include a signal peptide at about amino 
acids 1-20; potential N-glycosylation sites at about amino acids 204-207, 295-298, and 313-316; and putative 
transmembrane domams at about amino acids 35-54, 75-97. 126-146. 185-204. 333-350, and 353-371. 
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An analysis of the Dayhoff database (version 35.45 SwissProi 35). using a WU-BLAST-2 sequence 
alignment analysis of the full-length sequence shown in Figure 1 1 1 (SEQ ID NO: 177), evidenced some amino 
acid sequence identity between the PR01115 amino acid sequence and the following Dayhoff sequences: 
AF053947^79. S73698. CEC47AiO_4, CC0MTNDS5G^1 . HS4LMP2AC_1. LMP2^EBV. PA24_^MOUSE. 
HCU33331_7. P.W05508, and AF002273_1. 
5 Clone DNA56868- 1478 was deposited with the AtCC on June 23 . 1998 and is assigned ATCC deposit 

no. 203024.. 

EXAMPLE 47 : Isolation of cDNA clones Encoding Human PRO 1277 

A consensus DNA sequence was assembled relative to other ESTs using repeated cycles of BLAST and 
10 the program '"phrap" as descrited in Example 1 above. One or more of the ESTs from the assembly was 
derived from diseased coronary artery tissue. The consensus sequence obtained is designated herein as 
•*DNA49434-. 

In light of an observed sequence homology between the DNA49434 consensus sequence and an EST 
sequence encompassed within die Incyte EST clone no. 3042605, the Incyte EST clone 3042605 was purchased 
15 and the cDNA insen was obtained and sequenced. It was found that this insen encoded a full-length protein. 
The sequence of this cDNA insen is shown in Figure 112 (SEQ ID NO: 178). 

Clone DNA56869-1545 contains a single open reading frame with an apparent translational initiation 
site at nucleotide positions 188-190, and an apparent stop codon at nucleotide positions 2222-2224 (Figure 1 12). 
The predicted polypeptide precursor is 678 amino acids long (Figure 1 13). The full-length PR01277 protein 
20 shown in Figure 113 has an estimated molecular weight of about 73,930 daltons and a pi of about 9.48. 
Additional features include a signal peptide at about amino acids 1-26; a transmembrane domain at about amino 
'acids 181-200. and potential N-glycosylation sites at about amino acids 390-393 and 520-523. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST-2 sequence 
alignment analysis of the full-length sequence shown in Figure 113 (SEQ ID NO: 179), revealed significant 
25 homology between the PR01277 amino acid sequence and Dayhoff sequence no AF012252_1 . Homology was 
also found between the PR01277 amino acid sequence and die following Dayhoff sequences: AF006740_1, 
CA36_HUMAN, HSUl^l, HUMC0L7A1X_I, CA17^HUMAN, MMZ78163_1. CAMA^CHICK, 
HSU69263_I, YNX3^CAEEL, and MMRNAM3^1. 

Clone DNA56869-1545 has been deposited with ATCC and is assigned ATCC deposit no. 203161. 

30 

EXAMPLE 48 : Isolation of cDNA Clones Encoding Human PR01135 

A consensus DNA sequence was assembled relative to other EST sequences using phrap as described 
in Example 1 above. This consensus sequence is herein desigiiated DNA52767. Based on the DNA52767 
consensus sequence, oligonucleotides were synthesized: 1) to idenuiy by PCR a cDNA library diat contained 
35 the sequence of interest, and 2) for use as probes to isolate a clone of the fiill-lengdi coding sequence for 
PROI135. 
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In order to screen several libraries for a source of a fiill-length clone, DNA from the libraries was 
screened by PCR amplification with PCR primer pairs prepared based upon the DN A52767 sequence. A positive 
library was then used to isolate clones encoding the PROl 135 gene using the probe oligonucleotide and one of 
the PCR primers. RNA for constriiction of the cDNA libraries was isolated from human coronary anery smooth 
muscle tissue (LIB309). The cDNA libraries used to isolate the cDNA clones were constructed by standard 
5 methods using commercially available reagents such as diose from Invitrogen, San Diego, CA. The cDNA was 
primed with oligo dT containing a NotI site, linked with blunt to Sail hemikinased adaptors, cleaved with NotI, 
sized appropriately by gel electrophoresis, and cloned in a defmed orientation into a suitable cloning vector (such 
as pRKB or pRKD; pRK5B is a precursor of pRK5D that does not contain the Sfil site; see. Holmes et al.. 
Science . 253:1278-1280 (1991)) in the unique Xhol and Noll sites. 
10 DNA sequencing of die clones isolated as described above gave the fulMength DNA sequence for 

PR01135 [herein designated as DNA56870-1492] (S£Q ID NO: 180) and die derived protein sequence for 
PROn35. 

The entire nucleotide sequence of DNA56870-1492 is shown in Figure 1 14 (SEQ ID NO: 180). Clone 
DNA56870-1492 contains a single open reading frame with an apparent translational initiation site at nucleotide 

IS positions 62-64 and ending at the stop codpn at nucleotide positions 1685-1687 (Figure 1 14). The predicted 
polypeptide precursor is 541 amino acids long (Figure 1 15). The full-length PROl 135 protein shown in Figure 
1 15 has an estimated molecular weight of about 60,335 dalions and a pi of about 5.26. Analysis of die full- 
length PROU35 sequence shown in ¥igm 115 (SEQ ID NO: 181) evidences the presence of die following: a 
signal peptide from about amino acid 1 to about aino acid 21 , potential N-glycosylation sited from about amino 

20 acid 53 to about amino acid 56, from about amino acid 75 to about amino acid 78, from about amino acid 252 
to about amino acid 255 and from about amino acid 413 to about amino acid 416 and an amino acid block having 
homology to glycosyl hydrolase family 35 proteins from about amino acid 399 to about amino acid 414. Clone 
DNA56870-1492 has been deposited widi ATCC on June 2, 1998 and is assigned ATCC deposit no. 209925. 
Analysis of the amino acid sequence of the fiill-lengih PROl 135 polypeptide suggests that it possesses 

25 significant sequence similarity to the alpha 1,2-mannosidase protein, thereby indicating that PROl 135 may be 
a novel mannosidase. More specifically, an analysis of the Dayhoff database (version 35.45 SwissProt 35) 
evidenced significam homology between die PROl 135 amino acid sequence and the following Dayhoff 
sequences, DMC86E4_5, 086967^1, SPAC23A1_4. YH04^YEAST,B54408,SSMAN9MAN^1, CEZC410_4, 
S61631 andMSU14190J. 

30 

EXAMPLE 49: Isolation of cDNA Clones Encoding Human PROl 1 14 

A cDN A sequence isolated in the amylase screen described in Example 2 above was found, by the WU- 
BLAST-2 sequence alignment computer program, to have certain sequence identity to other known interferon 
receptors. ThiscDNAsequenceishereindesignatedDNA48466andisshowninFigure 118(SEQIDNO:184). 
35 Based on die sequence identity, probes were generated from the sequence of die DN A48466 molecule and used 
to screen a human breast carconoma libraiy (LIB135) prepared as described in paragraph 1 of Example 2 above. 
The cloning vector was pRK5B (pRK5B is a precursor of pRIC5D that does not contain the Sfil site; see. Holmes 
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et al.. Science , ^:I278-1280 (1991)), and the cDNA size cut was less than 2800 bp. 

The oligonucleotide probes employed were as follows: 
forward PGR primer 5'-AGGCTTCGCTGCGACTAGACCTC>3' (SEQ ID NO: 185) 
reverse PCR primer S'-CCAGGTCGGGTAAGGATGGTTGAG-3' (SEQ ID NO: 186) 
hybridization probe 

5 5'-TTTCTACGCATTGATTCCATGTTTGCTCACAGATGAAGTGGCCATTCrrGC-3* (SEQ ID NO: 187) 
A full length clone was identified that contained a single open reading frame with an apparent 
cranslational initiation site at nucleotide positions 250-252, and a stop signal at nucleotide positions 1 183-1 185 
(Figure 1 16, SEQ ID NO: 182). The predicted polypeptide precursor is 3 1 1 amino acids long, has a calculated 
molecular weight of approximately 35,076 daltons and an estimated pi of ^proximately 5.04. Analysis of the 

10 full-length PRO 1 1 14 interferon receptor sequence shown in Figure 1 1 7 (SEQ ID NO: 183) evidences the presence 
of the following: a signal peptide from about amino acid 1 to about amino acid 29, a transmembrane domain 
from about amino acid 230 to about amino acid 255, potential N-glycosylation sites from about amino acid 40 
to about amino acid 43 and from about amino acid 134 to about amino acid 137, an amino acid sequence block 
having homology to tissue factor proteins from about amino acid 92 to about amino acid 1 19 and an amino acid 

15 sequence block having homology to integrin alpha chain proteins from about amino acid 232 to about amino acid 
262. Clone DNA57033-1403 has been deposited with ATCC on May 27, 1998 and is assigned ATCC deposit 
no. 209905. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35). using a WU-BLAST-2 sequence 
alignment analysis of the full-length sequence shown in Figure 1 17 (SEQ ID NO: 183), evidenced significant 
20 homology between the PROl 1 14 interferon receptor amino acid sequence and the following Dayhoff sequences: 
G01418. INR1_M0USE, P_R71035, INGS.HUMAN. A26595_l, A26593_I, 156215 andTF^HUMAN. 

EXAMPLE 50 : Isolation of cDNA Clones Encoding Human PR0828 

A consensus DNA sequence was identified using the method described in Example I above. This 
25 consensus sequence is herein designated DNA35717. Based on the DNA35717 consensus sequence, 

oligonucleotides were synthesized: I) to identify by PCR a cDNA library that contained the sequence of interest, 

and 2).for use as probes to isolate a clone of the full-length coding sequence for PR0828. 
PCR primers (forward and reverse) were synthesized: 

forward PCR Primer 5'-GCAGGACTTCTACGACTTCAAGGC-3' (SEQ ID NO: 190); and 
30 reverse PCR primer 5*-AGTCTGGGCCAGGTACTTGAAGGC.3' (SEQ ID NO: 191). 

Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus DNA35717 

sequence which had the following nucleotide sequence: 

hvbridization probe 

5*-CAACATCCGGGGCAAACTGGTGTC(XrrGGAGAAGTACCGCGGATCGGTGT-3* (SEQ ID NO:192) 
35 In order to screen several libraries for a source of a fiill-lengih clone, DNA from the libraries was 

screened by PCR amplification with the PCR primer pair identified above. A positive library was then used to 
isolate clones encoding the PR0828 gene using the probe oligonucleotide and one of the PCR primers. RNA 



418 



wo 99/63088 



PCTAJS99/12252 



for construction of the cDNA libraries was isolated from human fetal lung tissue (LIB25). 

DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for 
PR0828 [herein designated as DNAS7037-1444] (SEQ ID NO: 188) and the derived protein sequence for 
PR0828. 

The entire nucleotide sequence of DNA57037-1444 is shown in Figure 1 19 (SEQ ID NO: 188). Clone 
5 DNA57037-1444 contains a single open reading frame with an apparent translational initiation site at nucleotide 
positions 34-36 and ending at the stop codon at nucleotide positions 595-597 (Figure 119). The predicted 
polypeptide precursor is 187 amino acids long (Figure 120). The full-length PR0828 protein shown in Figure 
120 has an estimated molecular weight of about 20,996 daltons and a pi of about 8.62. Analysis of the full- 
length PR0828 sequence shown in Figure 120 (SEQ ID NO: 189) evidences the presence of the following: a 

10 signal peptide at about amino acids 1- 21; sequences identity to glutathione peroxidases signature 2 at about 
amino acids 82-89; sequence identity to glutathione peroxidases selenocysteine proteins at about amino acids 35- 
60, 63-100, 107-134, and 138-159. Clone DNA57037-1444 has been deposited widi ATCC on May 27, 1998, 
and is assigned ATCC deposit no. 209903. 

Analysis of the amino acid sequence of the fuU-lengdi PR0828 polypeptide suggests that it possesses 

15 significant sequence similarity to glutathione peroxidases, thereby indicating that PR0828 may be a novel 
peroxidase enzyme. More specifically, an analysis of the Dayhoff database (version 35.45 SwissProt 35) 
evidenced sequence identity between the PR0828 amino acid sequence and die following Dayhoff sequences: 
AF053311^1, CELT09A12_2. AC004151_3, BTUE.ECOLI, CER05H10J, P^P80918, PWU88907^1. and 
P_W22308, 

20 

EXAMPLE 51 : Isolation of cDNA clones Encoding Human PRO1009 

A cDNA clone (DNA57129-1413) encoding a native human PRO1009 polypeptide was identified by 

the use of a yeast screen, in a hiunan SK-Lu-1 adenocarcinoma cell line cDNA library that preferentially 

represents the 5' ends of the primary cDNA clones. First SEQ ID NO: 195 (Figure 123) was identified, which 
25 was extended by alignments to other EST sequences to form a consensus sequence. Oligonucleotide probes 

based upon the consensus sequence were synthesized and used to screen the cDN A library which gave rise to 

the full-length DNA57129-I413 clone. 

The full length DNA57129-1413 clone shown in Figure 121 contained a single open reading frame with 

an apparent translational initiaticm site at nucleotide positions 41-43 and endmg at the stop codon found at 
30 nucleotide positions 1886-1888 (Figure 121; SEQ ID NO:193). The predicted polypeptide precursor (Figure 

122, SEQ ID NO: 194) is 615 amino acids long. Figure 122 also shows the approximate locations of the signal 

sequence, transmembrane domains, myristoylation sites, a glycosylation site and an AMP-binding domain. 

PRO1009 has a calculated molecular weight of approximately 68,125 daltons and an estimated pi of 

approximately 7.82. Clone DNA57129-1413 has been deposited with ATCC and is assigned ATCC deposit no. 
35 209977 . It is understood that the deposited clone has the acnial and correct sequence and that the representations 

herein may have minor, normal sequencing errors. 
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Based on a WU-BLAST-2 sequence alignment analysis (using the ALIGN computer program) of the 
full-length sequence, PRO1009 shows amino acid sequence identity to at least the following proteins which were 
designated in a Dayhoff database as follows: F69893, CEF28F8J, BSY13917_7. BSY13917_7, D69I87, 
D69649, XCRPFB^l. E64928, YDID_ECOU. BNACSFS^l and RPU75363J. 

5 EXAMPLE 52 : Isolation of cDNA Clones Encoding Human PRO10Q7 

A consensus DNA sequence was assembled relative to other EST sequences using phrap as described 
in Example 1 above. This consensus sequence is herein designated as DNA4067L 

In light of an observed sequence homology between the DNA40671 consensus sequence and an EST 
sequence encompassed within the Merck EST clone no. T70Si3, the Merck EST clone T70S13 was purchased 
10 and the cDNA insen was obtained and sequenced. It was found that this insen encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 124. 

The entire nucleotide sequence of DNA57690-1374 is shown in Figure 124 (SEQ ID NO: 196). Clone 
DNAS7690-1374 contains a single open reading frame with an apparent translational initiation site at nucleotide 
positions 16-18 ami ending at the stop codon at nucleotide positions 1054-1056 (Figure 124). The predicted 
15 polypeptid^precursor is 346 amino acids long (Figure 125). The fiill-lengdi PRO1007 protein shown in Figure 
125 has an estimated molecular weight of about 35»971 daltons and a pl of about 8. 17. Clone DNA57690-1374 
has been deposited with the ATCC on June 9, 1998. Regarding the sequence, it is understood that the deposited 
clone contains the actual sequence, and the sequences provided herein are based on known sequencing 
techniques. The representative figures herein show the representative numbering. 
20 Analysis of the amino acid sequence of the full-length PRO 1007 polypeptide suggests that portions of 

it possess sequence identity to MAGPIAP, thereby indicating that PRO 1007 may be a novel member of the 
family to which MAGPIAP belongs. . 

Still analyzing the amino acid sequence of SEQ ID NO: 191, the putative signal peptide is at about amino 
acids 1-30 of SEQ ID NO: 197. The uansmembrane domain is at amino acids 325-346 of SEQ ID NO: 197. N- 
25 glycosylaiion sites are at about amino acids 118-121, 129-132, 163-166. 176-179, 183-186 and 227-130of SEQ 
ID NO: 197. Ly-6/u-Par domain protein homology is at about amino acids 17-36 and 209-222 of SEQ ID 
NO: 197. The corresponding nucleotides of the amino acids presented herein can be routinely determined given 
the sequences provided herein. 

30 EXAMPLE 53: Isolation of cDNA clones Encoding Human PRO 1056 

Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST 
cluster sequence from the Incyte database, designated herein as 6425. This EST cluster sequence was then 
conqiared to a variety of expressed sequence tag (EST) databases which included public EST databases (e.g., 
GenBank) and a proprietary EST DNA daubase (Lifeseq^, Incyte Pharmaceuticals, Palo Alto, CA) to identify 

35 existing homologies. The homology search was performed using the computer program BLAST or BLAST2 
(Altshul et al.. Methods in Enzvmolopv 266:460-480 (1996)). Those comparisons resulting in a BLAST score 
of 70 (or in some cases 90) or greater that did not encode known proteins were clustered and assembled into a 
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consensus DNA sequence with the program •'phrap" (Phil Green, University of Washington, Seattle. 
Washington). The consensus sequence obtained iherefrora is herein designated DNA55736. 

In light of an observed sequence homology between the DNA55736 consensus sequence and an EST 
sequence encompassed within the Merck EST clone no. R88049, the Merck EST done R88049 was purchased 
and the cDNA insen was obtained and sequenced. It was found that this insen encoded a full-length protein. 

5 The sequence of this cDNA insert is shown in Figure 126 and is herein designated as DNA57693-1424. 

Clone DNA57693-1424 contains a single open reading frame with an apparent translaiional initiation 
site at nucleotide positions 56-58 and ending at the stop codon at nucleotide positions 416^18 (Figure 126). The 
predicted polypeptide precursor is 120 amino acids long (Figure 127). The full-length PRO1056 protein shown 
in Figure 127 has an estimated molecular weight of about 13,345 daltons and a pi of about 5. 18. Analysis of 

10 the fiilWenglh PRO1056 sequence shown in Figure 127 (SEQ ID NO: 199) evidences the presence of the 
following: a signal peptide from about amino acid 1 to about amino acid 1 8, a transmembrane domain from about 
amino acid 39 to about amino acid 58. a potential N-glycosylation site from about amino acid 86 to about amino 
acid 89, protein kinase C phosphorylation sites from about amino acid 36 to about amino acid 38 and from about 
amino acid 58 to about amino acid 60, a tyrosine kinase phosphorylation site from about amino acid 25 to about 

IS amino acid 32 and an amino acid sequence block having homology to channel forming coiicin proteins from 
about amino acid 24 to about amino acid 56. Clone DNA57693-1424 has been deposited with ATCC on June 
23, 1998 and is assigned ATCC deposit no. 203008. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST-2 sequence 
alignment analysis of the full-length sequence shown in Figure 127 (SEQ ID NO: 199), evidenced significant 

20 homology between the PRO1056 amino acid sequence and the following Dayhoff sequences: PLM HUMAN, 
A40533, ATNG_HUMAN, A55571, ATNG^SHEEP, S31524, GEN13025, RIC_MOUSE, A48678 and 
A10871^1. 

EXAMPLE 54: Isolation of cDNA clones Encoding Human PR0826 

25 Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST 

cluster sequence from the Incyte database, designated 47283. This EST cluster sequence was then compared 
to a variety of expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) 
and a proprietary EST DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing 
homologies. The homology search was performed using the computer program BLAST or BLAST2 (Altshul 

30 et al., Methods in Enzvmologv 266:460480 (1996)). Those comparisons resulting in a BLAST score of 70 (or 
in some cases 90) or greater that did not encode known proteins were clustered and assembled into a consensus 
DNA sequence with the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The 
consensus sequence obtained therefrom is herein ctesignated DNA56000. 

In light of an observed sequence homology between the DN A56000 consensus sequence and an EST 

35 sequence encompassed within the Merck EST clone no. W69233. the Merck EST clone W69233 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insen encoded a full-length protein. 
The sequence of this cDNA insen is shown in Figure 128 and is herein designated as DNA57694-1341. 
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Clone DNA57694>1341 contains a single open reading frame with an apparent translational initiation 
site at nucleotide positions 13-15 and ending at the stop codon at nucleotide positions 310-312 (Figure 128). The 
predicted polypeptide precursor is 99 amino acids long (Figure 129). The full-length PR0826 protein shown 
in Figure 129 has an estimated molecular weight of about 11,050 daltons and a pi of about 7.47. Analysis of 
the full-length PR0826 sequence shown in Figure 129 (SEQ ID NO:201) evidences the presence of the 
5 following: a signal peptide from about amino acid 1 to about amino acid 22, potential N-myristoylaiion sites from 
about amino acid 22 to about amino acid 27 and from about amino acid 90 to about amino acid 95 and an amino 
acid sequence block having homology to peroxidase from about amino acid 16 to about amino acid 48. Clone 
DNA57694-1341 has been deposited with ATCC on June 22, 1998 and is assigned ATCC deposit no. 203017. 
An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST-2 sequence 
10 alignment analysis of the full-length sequence shown in Figure 129 (SEQ ID NO:201), evidenced significant 
homology between the PR0826 amino acid sequence and the following Dayhoff sequences: CCU12315_1, 
SCU96108^6, CELF39F10_4 and HELT^HELHO. 

EXAMPLE 55: Isolation of cDNA clones Encoding Human PROS 19 

15 Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST 

cluster sequence from the Incyte database, designated 49605. This EST cluster sequence was then compared 
to a variety of expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) 
and a proprietary EST DNA database (LIFESEQ®, Incyie Pharmaceuticals, Palo Alto, CA) to identify existing 
homologies. The homology search was performed using the computer program BLAST or BLAST2 (Altshul 

20 et al.. Methods in Enzvmologv 266:460-480 (1996)). Those comparisons resuhing in a BLAST score of 70 (or 
in some cases 90) or greater that did not encode known proteins were clustered and assembled into a consensus 
DNA sequence with the program **phrap** (Phil Green, University of Washington, Seattle. Washington). The 
consensus sequence obtained therefrom is herein designated DNA56015. 

In light of an observed sequence homology between the DNA56015 consensus sequence and an EST 

25 sequence encompassed within the Merck EST clone no. H65785, the Merck EST clone H65785 was purchased 
and the cDNA insen was obtained and sequenced. It was foimd that this insen encoded a fulMength protein. 
The sequence of this cDNA insen is shown in Figure 130 and is herein designated as DNA57695-1340. 

Clone DNA57695-1340 contains a single open reading frame with an apparent translational initiation 
site at nucleotide positions 46-48 and ending at the stop codon at nucleotide positions 202-204 (Figure 130). The 

30 predicted polypeptide precursor is 52 amino acids long (Figure 131). The full-length PR0819 protem shown 
in Figure 131 has an estimated molecular weight of about 5,216 daltons and a pi of about 4.67. Analysis of the 
full-length PR0819 sequence shown in Figure 131 (SEQ ID NO:203) evidences the presence of the following: 
a signal peptide from about amino acid 1 to about amino acid 24, a potential N-myristoylation site from about 
amino acid 2 to about amino acid 7 and a region haying homology to immunoglobulin light chain from about 

35 amino acid 5 to about amino acid 33. Clone DNA57695-1340 has been deposited with ATCC on June 23. 1998 
and is assigned ATCC deposit no. 203006. 
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An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the iiill-length sequence shown in Figure 131 (SEQ ID NO:203), evidenced significant 
homology between the PR0819 amino acid sequence and the following Dayhoff sequences: HSU03899_1. 
HUMIGUTEB^l, VG28_HSVSA, AF031522^l. PADl^YEAST and AF045484^1. 

5 EXAMPLE 56 : Isolation of cDNA Clones Encoding Human PRO1006 

An initial candidate sequence from Incyte cluster sequence no. 45748 was identified using the signal 
algorithm process described in Example 3 above. This sequence was then aligned with a variety of public and 
Incyte EST sequences and a consensus sequence designated herein as DNA56036 was derived therefrom. 

In light of an observed sequence homology between the DNA56036 consensus sequence and an EST 
10 sequence encompassed within the Merck EST clone no. 489737, the Merck EST clone 489737 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown m Figure 132. 

The entire nucleotide sequence of DNA57699-1412 is shown in Figure 132 (SEQ ID NO:204). Clone 
DNA57699-I412 contains a single open reading frame with an apparent iranslational initiation site at nucleotide 
15 positions 28-30 and ending at the stop codon at nucleotide positions 1204-1206 (Figure 132). The predicted 
polypeptide precursor is 392 amino acids long (Figure 133). The full-length PR01(X)6 protein shown in Figure 
133 has an estimated molecular weight of about 46,189 daltons and a pi of about 9.04. Clone DNA57699-1412 
has been deposited with the ATCC. Regarding the sequence, it is understood that the deposited clone contains 
the correct sequence, and the sequences provided herein are based on known sequencing techniques. 
20 Analyzing the amino acid sequence of SEQ ID NO:205, the putative signal f^ptide is at about amino 

acids 1-23 of SEQ ID NO:205. The N-glycosylation sites are at about amino acids 40-43. 53-56, 204-207 and 
373-376 of SEQ ID NO:205. An N-myristoylation site is at about amino acids 273-278 of SEQ ID NO:205. 

The corresponding nucleotides of these amino acid regions and others can be routinely determined given the 
sequences provided herein. 

25 

EXAMPLE 57: Isolation of cDNA Clones Encoding Human PRQ1112 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a specific 
EST cluster sequence. This EST cluster sequence was then compared to a variety of expressed sequence tag 
(EST) databases which included public EST databases (e.g., GenBank) and a proprietary EST DNA database 
30 (LIFESEQ^, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The homology search was 
performed using the conq)uter program BLAST or BLAST2 (Altshul et al.. Methods in Enzvmoloev 266:460- 
480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) or greater that did not 
encode known proteins were clustered and assembled into a consensus DNA sequence with the program ''phrap'* 
(Phil Green, University of Washington, Seattle, Washington). The consensus sequence obtained therefrom is 
35 herein designated DNA56018. 

In light of an observed sequence homology between the DNA56018 consensus sequence and an EST 
sequence encompassed within the Merck EST clone no. AA223546. the Merck EST clone AA223546 was 
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purchased and the cDNA insert was obtained and sequenced. It was found that this insen encoded a fiill-length 
protein. The sequence of this cDNA insert is shown in Figure 134 and is herein designated as DNA57702-1476. 

The entire nucleotide sequence of DNA57702-1476 is shown in Figure 134 (SEQ ID NO:206), Clone 
DNA57702-1476 contains a single open reading frame with an apparent translational initiation site at nucleotide 
positions 20-22 and ending at the stop codon at nucleotide positions 806-808 of SEQ ID NO:206 (Figure 134). 
5 The predicted polypeptide precursor is 262 amino acids long (Figure 135). The fiili-length PROl 1 12 protein 
shown in Figure 135 has an estimated molecular weight of about 29,379 daltons and a pl of about 8.93. Figure 
135 also shows the approximate locations of the signal peptide and transmembrane domains. Clone DNA57702- 
1476 has been deposited with the ATCC on June 9, 1998. It is understood chat the deposited clone has the acmal 
nucleic acid sequence and that the sequences provided herein are based on known sequencing techniques. 

10 Analysis of the amino acid sequence of the full-length PROl 1 12 polypeptide suggests that h possesses 

some sequence similarity to other proteins. More specifically, an analysis of the Dayhoff database (version 
35.45 SwissProt 35) evidenced some sequence identity between the PROl 1 12 amino acid sequence and at least 
the following Dayhoff sequences, MTY20B11_13 (a mycobacterium mberculosis peptide), F64471, 
AE000690_6, XLU16364_1, E43259 (H+ -transporting ATP synthase) and PIGSLADRXE_1 (MHC class II 

IS histocompatibility antigen). 

EXAMPLE 58: Isolation of cDNA clones Encoding Human PRO 1074 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
Incyte EST cluster sequence (Incyie cluster sequence No. 42586). This cluster sequence was then compared to 

20 a variety of expressed sequence tag (EST) databases which included public EST databases (e.g. , GenBank) and 
a proprietary EST DNA database (LIFESEQ™, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing 
homologies. The homology search was performed using the computer program BLAST or BLAST2 (Altshul 
et al.. Methods in Enzvmology 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or 
in some cases 90) or greater that did not encode known proteins were clustered and assembled into a consensus 

25 DNA sequence with the program "phrap" (Phil Green, Univ. of Washington, Seattle, Washington). The 
consensus sequence obtamed therefrom is herein designated DNA56251. 

In light of an observed sequence homology between the DNA56251 consensus sequence and an EST 
sequence encompassed within the Merck EST clone no. AA081912, the Merck EST clone AA081912 was 
purchased and the cDNA insen was obtained and sequenced. It was found that this insen encoded a full-length 

30 protein. The sequence of this cDNA insen is shown in Figure 136 and is the fiili-length DNA sequence for 
PROI074. Clone DNA57704-1452 was deposited with the ATCC on June 9, 1998, and is assigned ATCC 
deposit no. 209953. 

The entire nucleotide sequence of DNA57704-1452 is shown in Figure 136 (SEQ ID NO:208). Clone 
DNA57704-1452 contains a single open reading frame with an apparent translational initiation site at nucleotide 
35 positions 322-324 and ending at the stop codon at nucleotide positions 1315-1317 (Figure 136). The predicted 
polypeptide precursor is 331 ammo acids long (Figure 137). The full-length PRO 1074 protein shown in Figure 
137 has an estimated molecular weight of about 39,512 Daltons and a pi of about 8.03. Analysis of the fiill- 
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length PRO1074 sequence shown in Figure 137 (SEQ ID NO:209) evidences the presence of the following 
feanires: a transmembrane domain at about amino acids 20 to 39; potential N-glyccsylation sites at about amino 
acids 72 to 75. 154 to 157, 198 to 201 , 212 to 215, and 326 to 329; a glycosaminoglycan attachment site at about 
amino acids 239 to 242, and a Ly-6/u-PAR domain at about amino acids 23 to 36. 

Analysis of the amino acid sequence of the fiill-lengih PRO1074 polypeptide suggests thai it possesses 

5 significant sequence similarity to beta 1 ,3-galactosyltransferase, thereby indicating that PRO 1074 may be a novel 
member of the galactosyltransferase family of protems. Analysis of die amino acid sequence of die full-length 
PRO 1074 polypeptide using die Dayhoff database (version 35.45 SwissProi 35) evidenced homology between 
the PRO 1074 amino acid sequence and die following Dayhoff sequences: AF029792_1. P_R57433. 
DMU41449^I. AC000348^14, P_R47479. CET09F5_2. CEF14B6_4, CET15D6^5. CEC54C8^4. and 

10 CEE03H4^10. 

Clone DNA57704-1452 was deposited with die ATCC on June 9. 1998, and is assigned ATCC deposit 
no. 209953. 

EXAMPLE 59: Isolation of cDNA clones Encoding Human PRO 1005 

15 Use of die signal sequence algoridun described in Example 3 above allowed identification of an EST 

cluster sequence from the LIFESEQ* database, Incyte cluster sequence no. 49243. This EST cluster sequence 
was dien compared to a variety of expressed sequence tag (EST) databases which included public EST databases 
(e.g., GenBank) and a proprietary EST DNA database (LIFESEQ®. Incyte Pharmaceuticals, Palo Alto, CA) to 
identify existing homologies. The homology search was performed using the computer program BLAST or 

20 BLAST2 (Altshul ei al.. Methods in Enzvmology 266:460>480 (1996)). Those comparisons resulting in a 
BLAST score of 70 (or in some cases 90) or greater tfiat did noi encode known proteins were clustered and 
assembled into a consensus DNA sequence with die program "phrap" (Phil Green. University of Washington, 
Seattle, Washington). The consensus sequence obtained dierefrom is herein designated DNA56380. 

In light of an observed sequence homology between the DNA56380 consensus sequence and an EST 

25 sequence encompassed widiin die Merck EST clone no. AA256657, die Merck EST clone AA256657 was 
purchased and die cDNA insen was obtained and sequenced. It was found diat diis insen encoded a fiill-lengdi 
protein. The sequence of diis cDN A insen is shown in Figure 1 38 and is herein designated as DN A57708- 1411. 

The foil length clone shown in Figure 138 contained a single open reading frame widi an apparent 
translational initiation site at nucleotide positions 30-32 and ending at die stop codon found at nucleotide positions 

30 585-587 (Figure 138; SEQ ID NO:210). The predicted polypeptide precursor (Figure 139, SEQ ID N0:2I1) 
is 185 amino acids long. PRO1005 has a calculated molecular weight of approximately 20.331 dalions and an 
estimated pi of approximately 5.85. Clone DNA57708-141 1 was deposited widi the ATCC June 23. 1998, and 
is assigned ATCC deposit no. 203021 . 

An analysis of die Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 

35 alignment analysis of die foll-lengdi sequence shown in Figure 139 (SEQ ID NO:211), evidenced some 
homology between die PRO1005 amino acid sequence and die following Dayhoff sequences: DDU07I87_1, 
DDU87912_1, CELDI007^14, A42239, DDU42597_K CYAG.DICDL S50452, MRKC^KLEPN, P-R41998, 
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andXYNA^RUMFL. 

EXAMPLE 60 : Isolation of cDNA clones Encoding Human PRO 1073 

An initial DNA sequence referred to herein as DNA55938 and shown in Figure 142 (SEQ ID NO:214) 
was identified using a yeast screen, in a human SK-Lu-1 adenocarcinoma cell line cDNA library that 
5 preferentially represents the 5* ends of the primary cDNA clones. DN A55938 was then compared to ESTs from 
public databases (e.g., GenBank), and a proprietary EST database (LIFESEQ®, Incyte Pharmaceuticals, Palo 
Alto, CA), using the computer program BLAST or BLAST2 [ Altschul ei al. . Methods in Enzvmolo^. 266:460- 
480 (1996)]. The ESTs were clustered and assembled into a consensus DNA sequence using the computer 
program *phrap** (Phil Green, University of Washington, Seanle, Washington). The consensus sequence 

10 obtained is designated herein as DNA5641 1. 

In light of an observed sequence homology between the DNA5641 1 consensus sequence and an EST 
sequence encompassed within the Merck EST clone no. H86027, the Merck EST clone H86027 was purchased 
and the cDNA insen was obtained and sequenced. It was foimd that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 140. 

15 The fiill length DNA57710-14S1 clone shown in Figure 140 contained a single open reading frame with 

an apparent translational initiation site at nucleotide positions 345-347 and ending at the stop codon found at 
nucleotide positions 1242-1244 (Figure 140; SEQ ID NO:212). The predicted polypeptide precursor (Figtu-e 
141. SEQ ID NO:213) is 299 amino acids long. PRO1073 has a calculated molecular weight of approximately 
34,689 daltons and an estimated pi of approximately 11.49. The PRO 1073 polypeptide has the following 

20 additional feamres: a signal peptide at about amino acids 1-31, sequence identity to bZIP transcription factor 
basic domain signature at about amino acids, a potential N-glycosylaiion site at about amino acids 2-5, and 
sequence identity with protamine PI proteins at about amino acids 158-183. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST-2 sequence 
alignment analysis of the full-length sequence shown in Figure 14 i (SEQ ID NO:213), revealed some sequence 

25 identity between the PRO1073 amino acid sequence and the following Dayhoff sequences: MMU37351_1, 
ATAC00250510T9J22.10. S59043, ENXNUPR^l, B47328, SR55^DROME, S26650, SON^HUMAN. 
Vrn^CHICK, and XLC4SRPRT^1. 

Clone DNA57710-1451 was deposited with the ATCC on July 1 , 1998 and is assigned ATCC deposit 
no. 203048. 

30 

EXAMPLE 61:Isolation of cDNA clones Encoding Human PRO 1 152 

A cDNA clone (DNA5771 1-1501) cncodmg a native human PR01152 polypeptide was identified by 
employing a yeast screen, in a human infant brain cDNA library that preferentially represents the 5* ends of the 
primary cDNA clones. Specifically, a yeast screen was employed to idemify a cDNA designated herein as 
35 DNA55807 (SEQ ID N0:217; see Figure 145). 

In light of an observed sequence homology between the DNA55807 sequence and an EST sequence 
encompassed within the Merck EST clone no. R56756, the Merck EST clone R56756 was purchased and the 
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cDNA insert was obtained and sequenced. It was found that this insen encoded a full-length protein. The 
sequence of this cDNA insen is shown in Figure 143. 

The full-length DNA5771 1-1501 clone shown in Figure 143 contains a single open reading frame with 
an apparent translational initiation site at nucleotide positions 58-60 and ending at the stop codon at nucleotide 
positions 1495-1497 (Figure 143). The predicted polypeptide precursor is 479 amino acids long (Figure 144). 
5 The full-length PROl 152 protein shown in Figure 144 has an estimated molecular weight of about 53,602 daltons 
and a pi of about 8.82. Analysis of die full-length PROl 152 sequence shown in Figure 144 (SEQ ID NO:216) 
evidences the presence of the following: a signal peptide from about amino acid 1 to about amino acid 28 « 
transmembrane domains from about amino acid 133 to about amino acid 155, from about amino acid 168 to 
about amino acid 187, from about amino acid 229 to about amino acid 247, from about amino acid 264 to about 
10 amino acid 285, from about amino acid 309 to about amino acid 330, from about amino acid 371 to about amino 
acid 390 and from about amino acid 441 to about amino acid 464, potential N-glycosylation sites from about 
amino acid 34 to about amino acid 37 and from about amino acid 387 to about amino acid 390 and an amino acid 
sequence block having homology to a respiratory-chain NADH dehydrogenase subimit from about amino acid 
243 to about amino acid 287. Clone DNA5771 1-1501 has been deposited with ATCC on July 1 , 1998 and is 
IS assigned ATCC deposit no. 203047. 

An analysis of die Dayhoff database (version 35.45 SwissProt 35). using a WU-BLAST-2 sequence 
alignment analysis of the full-lengdi sequence shown in Figure 144 (SEQ ID NO:216), evidenced significant 
homology between the PROl 152 amino acid sequence and the following Dayhoff sequences: AF052239_1, 
SYNN9CGA_1. SFCYTB2_1, GEN12507, P,R11769. MTV025_109. C61168, S43171, P_P61689 and 
20 P^P61696. 

EXAMPLE 62 : Isolation of cDNA clones Encoding Human PROl 136 

Use of die signal sequence algorithm described in Example 3 above allowed identification of an EST 
cluster sequence from die Incyte database, designated 109142. This EST cluster sequence was dien compared 
to a variety of expressed sequence tag (EST) databases which included public EST databases (e.g., (jenBank) 
and a proprietary EST DNA database (Lifeseq®, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing 
homologies. The homology search was performed using die computer program BLAST or BLAST2 (Altshul 
et al.. Methods in Enzvmologv 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or 
in some cases 90) or greater that did not encode known proteins were clustered and assembled into a consensus 
DNA sequence widi the program ''phrap'* (Phil Green, University of Washington, Seaule, Washington). The 
consensus sequence obtained dierefrom is herein designated DNA56039. 

In light of an observed sequence homology between die DN A56039 consensus sequence and an EST 
sequence encompassed widiin die Merck EST clone no. HSCINFOI 1, die Merck EST clone HSCINFOI 1 was 
purchased and die cDNA insen was obtained and sequenced. It was found diat diis insen encoded a full-lengdi 
protein. The sequence of diis cDNA insen is shown in Figure 146 and is herein designated as DNA57827-1493. 

Clone DNA57827-1493) contains a single open reading frame with an apparent translational initiation 
site at nucleotide positions 216-218 and ending at die stop codon at nucleotide positions 21 12-21 14 (Figure 146). 
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The predicted polypeptide precursor is 632 amino acids long (Figure 147). The full-length PROl 136 protein 
shown in Figure 147 has an estimated molecular weight of about 69,643 daltons and a pi of about 8.5. Analysis 
of the full-length PROl 136 sequence shown in Figure 147 (SEQ ID NO:219) evidences the presence of the 
following: a signal peptide from about amino acid 1 to about amino acid 15 and {Wtcniial N-glycosylation sites 
from about amino acid 108 to about amino acid 1 1 , from about amino acid 157 to about amino acid 160, from 
5 about amino acid 289 to about amino acid 292 and from about amino acid 384 to about amino acid 387. Clone 
DNA57827-1493 has been deposited with ATCC on July 1 , 1998 and is assigned ATCC deposit no. 203045. 

An analysis of die Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 147 (SEQ ID NO:219), evidenced significant 
homology between the PROl 136 amino acid sequence and the following Dayhoff sequences: AF034746 1. 
10 AF034745_1, MMAF000168J9, HSMUPP1_1. AF060539_1. SP97_RAT. 138757. MMU93309_1, 
CEK01A6J1 and HSA224747_1. 

EXAMPLE 63: Isolation of cDNA clones Encoding Human PR0813 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 

15 Incyie EST cluster sequence (Incyte EST cluster sequence no. 45501. The Incyte EST cluster sequence no. 
4550 1 sequence was then compared to a variety of expressed sequence ug (EST) databases which included public 
EST databases (e.g. , GenBank) and a proprietary EST DNA database (LIFESEQ ™, Incyte Pharmaceuticals, Palo 
Alio» CA) to identify existing homologies. The homology search was performed using the computer program 
BLAST or BLAST2 (Altshul ei al. , Methods in Enzvmology 266:460-480 (1996)). Those comparisons resulting 

20 in a BLAST score of 70 (or in some cases 90) or greater that did not encode known proteins were clustered and 
assembled into a consensus DNA sequence with the program "phrap" (Phil Green, University of Washington, 
Seattle, Washington), The consensus sequence obtained therefrom is herein designated DNA56400. 

In light of an observed sequence homology between the DN A564(X) consensus sequence and an EST 
sequence encompassed within the Merck EST clone no. T90592, the Merck EST clone T90592 was purchased 

25 and the cDNA insen was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insen is shown in Figure 148 and is herein designated DNA57834-1339. 

The full length clone shown in Figure 148 contained a single open reading frame with an apparent 
translational initiation site at nucleotide positions 109-111 and eitding at the stop codon foimd at nucleotide 
positions 637-639 (Figure 149; SEQ ID NO:221). The predicted polypeptide precursor is 176 amino acids long. 

30 has a calculated molecular weight of approximately 19,616 daltons and an estimated pl of approximately 7.11. 
Analysis of the full-length PR0813 sequence shown in Figure 149 (SEQ ID NO:221) evidences the presence of 
the following: a signal peptide from about amino acid 1 to about amino acid 26 and potenual N-myristoylation 
sites from about amino acid 48 to about amino acid 53, from about amino acid 153 to about amino acid 158, 
from about amino acid 156 to about amino acid 161 and from about amino acid 167 to abom amino acid 172. 

35 Clone DNA57834-1339 has been deposited with the ATCC on June 9^ 1998 and is assigned ATCC deposit no. 
209954. 
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Analysis of the amino acid sequence of the full-length PR0813 polypeptide suggests that it possesses 
sequence similarity to the pulmonary surfactani-associaicd protein C. More specifically, an analysis of the 
Dayhoff database (version 35.45 SwissProt 35) evidenced some degree of homology tetween the PR0813 amino 
acid sequence and the following Dayhoff sequences. PSPC^MUSVI, P^P9207 1 , G02964. P_R65489. P^P82977, 
P_R84555. S55542, MUSIGHAJ^l and PH1158. 

5 

EXAMPLE 64 : Isolation of cDNA Clones Encoding Human PRO809 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
Incyte EST duster sequence. The Incyte EST cluster sequence was then compared to a variety of expressed 
sequence tag (EST) databases which included public EST databases (e.g, . GenBank) and a proprietary EST DNA 

10 database (LIFESEQ"^, Incyte Pharmaceuticals, Palo Alto. CA) to identify existing homologies. The homology 
search was performed using the computer program BLAST or BLAST2 ( Altshul et al. . Methods in Enzvmolocv 
266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) or greater 
that ^d not encode known proteins were clustered and assembled into a consensus DNA sequence widi the 
program "phrap" (Phil Green, University of Washington, Seattle. Washington). The consensus sequence 

15 obtained therefrom is herein designated DNA56418. 

In light of an observed sequence homology between the DNA56418 consensus sequence and an EST 
sequence encompassed within the Merck EST clone no. H74302, the Merck EST clone H74302 was purchased 
and the cDNA insen was obtained and sequenced. It was found that this insen encoded a full-length protein. 
The sequence of this cDNA insen is shown in Figure 150 and is herein designated DNA57836-1338. 

20 The entire nucleotide sequence of DNA57836-1338 is shown in Figure 150 (SEQ ID NO:222). Clone 

DNA57836-1338 contains a single open reading frame with an apparent translational initiation site at nucleotide 
positions 63-65 and ending at the stop codon at nucleotide positions 858-860 of SEQ ID NO:222 (Figure 150). 
The predicted polypeptide precursor is 265 amino acids long (Figure 151). The full-length PRp809 protein 
shown in Figure 15 1 has an estimated molecular weight of about 29,06 1 daltons and a pi of about 9. 18. Figure 

25 151 further shows the approximate positions of the signal peptide and N-glysosylation sites. The corresponding 
nucleotides can be determined by referencing Figure 150. Clone DNA57836-I338 has been deposited with 
ATCC on June 23, 1998. h is understood that the deposited clone has the actual nucleic acid sequence and that 
the sequences provided herein are based on known sequencing techniques. 

Analysis of the amino acid sequence of die full-length PRO809 polypeptide suggests that it possesses 

30 some sequence similarity to the heparin sulfate proteoglycan and to endothelial cell adhesion molecule- 1 . More 
specifically, an analysis of the Dayhoff daubase (version 35.45 SwissProt 35) evidenced sequence identity 
between the PRO809 amino acid sequence and the following Dayhoff sequences, PGBM_MOUSE, D82082_l 
and PW14158. 

35 EXAMPLE 65: Isolation of cDNA Clones Encoding Human PR0791 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
Incyte EST cluster sequence. The Incyte EST cluster sequence was then compared to a variety of expressed 
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sequence tag (EST) databases which included public EST databases (e.g. , GenBank) and a proprietary EST DN A 
database (UFESEQ™, Incyie Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The homology 
search was performed using the computer program BLAST or BLAST2 ( Alishul et al . , Methods in Enzvmoiogy 
266:460-480 (1^6)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) or greater 
that did not encode known proteins were clustered and assembled into a consensus DNA sequence with the 
5 program *'phrap** (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 
obtained therefrom is herein designated DNA56429. 

In light of an observed sequence homology between the DNA56429 consensus sequence and an EST 
sequence encompassed within the Merck EST clone no. 36367, the Merck EST clone 36367 was purchased and 
the cDNA insert was obtained and sequenced. It was found that this insert encoded a fiilMength protein. The 

10 sequence of this cDNA insert is shown in Figure 152 and is herein designated DNA57838-1337. 

The entire nucleotide sequence of DNA57838-I337 is shown in Figure 152 (SEQ ID NO:224). Clone 
DNA57838-1337 contains a single open reading frame with an apparent translational initiation site at nucleotide 
positions 9-11 and ending at the stop codon at nucleotide positions 747-749 of SEQ ID NO:224 (Figure 152). 
The predicted polypeptide precursor is 246 amino acids long (Figure 153). The full-length PR0791 protein 

15 shown in Figure 153 has an estimated molecular weight of about 27,368 dahons and a pi of about 7.45. Figure 
153 also shows the approximate locations of the signal peptide, the transmembrane domain, N-glycosylation 
sites and a region conserved in extracelltdar proteins. The corresponding nucleotides of one embodiment 
provided herein can be identified by referencing Figure 152. Clone DNA57838-1337 has been deposited with 
ATCC on June 23, 1998. It is understood that the deposited clone has the actual nucleic acid sequence and that 

20 the sequences provided herein are based on known sequencing techniques. 

Analysis of the amino acid sequence of the full-lengih PR0791 polypeptide suggests that it has sequence 
similarity with MHC-I antigens, thereby indicating that PR0791 may be related to MHC-I antigens. More 
specifically, an analysis of the Dayhoff database (version 35.45 SwissProi 35) evidenced some sequenc identity 
between the PR0791 amino acid sequence and the following Dayhoff sequences, AF034346_1 , MMQ1K5_1 and 

25 HFE^HUMAN. 

EXAMPLE 66: Isolation of cDNA clones Encoding Human PRO10()4 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
Incyte EST cluster sequence, Incyte cluster sequence No. 73681 , This EST cluster sequence was then compared 

30 to a variety of expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) 
to identify existing homologies. The homology search was performed using the computer program BLAST or 
BLAST2 (Alishul et al.. Methods in Enzvmoiogy 266:460^80 (1996)). Those comparisons resulting in a 
BLAST score of 70 (or in some cases 90) or greater that did not encode known proteins were clustered and 
assembled into a consensus DNA sequence with the program "phrap" (Phil Green, Univ. of Washington, Seattle, 

35 Washington). The consensus sequence obtained therefrom is herein designated as DNA56S16. 

In li^i of an observed sequence homology between the DNA56516 consensus sequence and an EST 
sequence encompassed within die Merck EST clone no. H43837, the Merck EST clone H43837 was purchased 
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and the cDNA insen was obtained and sequenced. It was found that this insert encoded a ftiU-length protein. 
The sequence of this cDNA insert is shown in Figure 154. 

The ftiU length clone shown in Figure 154 contained a single open reading frame with an apparent 
translational iniiiaiion site at nucieodde positions 1 19-121 and ending ai the stop codon at nucleotide positions 
464-466 (Figure 154; SEQ ID NO:226). The predicted polypepude precursor is 1 15 amino acids long (Figure 
5 155; SEQ ID NO:227). The full-length PRO1004 protein shown in Figure 155 has an estimated molecular 
weight of about 13.649 daltons and a pi of about 9.58. Analysis of the full-length PRO1004 sequence shown 
in Figure 155 (SEQ ID NO:227) evidences the presence of the following features: a signal peptide at about amino 
acids 1-24, a microbodies C-tenninal targeting signal at about amino acids 1 13-1 15. a potential N-glycosylation 
site at about amino acids 71-74, and a domain having sequence identity with dihydrofolaie reductase proteins at 

10 about amino acids 22-48. 

Analysis of the amino acid sequence of the fiill-lengih PRO1004 polypeptide using the Dayhoff database 
(version 35.45 SwissProt 35) evidenced homology between the PRO 1004 amino acid sequence and the following 
Dayhoff sequences: CELR02D3J, LECLMOUSE, AF006691.3, SSZ97390.1, SSZ97395_1, and 
SSZ97400,1. 

15 Clone DNA57844-1410.was deposited with the ATCC on June 23. 1998, and is assigned ATCC deposit 

no. 203010. 

EXAMPLE 67: Isolation of cDNA clones Encoding H uman PRO 1 1 1 1 

An expressed sequence tag (EST) DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto. CA) 
20 was searched and an EST was identified which had homology to insulin-like growth factor binding protein. 

RN A for construction of cDN A libraries was isolated from human fetal brain. The cDN A libraries used 
to isolate the cDN A clones encoding human PRO 1111 were constructed by standard methods using commercially 
available reagents such as diose from Invitrogen. San Diego, CA. The cDNA was primed with oligo dT 
containing a NotI site, linked with blunt to Sail hcmildnased adaptors, cleaved with NotI, sized appropriately 
25 by gel electrophoresis, and cloned in a defined orientation into a suitable cloning vector (such as pRKB or 
pRKD; pRK5B is a precursor of pRK5D that does not contein the Sfil site; see. Holmes et al.. Science , 
253:1278-1280 (1991)) in the unique Xhol and NoU. 

The human fetal brain cDNA libraries (prepared as described above), were screened by hybridization 
with a synthetic oligonucleotide probe based upon the Incyte EST sequence described above: 
30 5'-CCACCACCTGGAGGTCCTGCAGTTGGGCAGG AACTCCATCCGGC AGATTG-3' (SEQ ID NO:25 1). 

An identified cDNA clone was sequenced in entirety. The entire nucleotide sequence of PROl 1 1 1 is 
shown in Figure 156 (SEQ ID NO:228). Clone DNA58721-1475 contains a single open readmg frame with an 
apparent translational initiation site at nucleotide positions 57-59 and a stop codon at nucleotide positions 2016- 
2018 (Figure 156; SEQ ID NO:228). The predicted polypepude precursor is 653 amino acids long (Figure 157). 
35 The transmembrane domains are at positions 21-40 (type II) and 528-548. Clone DNA5872M475 has been 
deposited with ATCC and is assigned ATCC deposit no. 203110. The full-length PROUll protein shown in 
Figure 157 has an estimated molecular weight of about 72,717 daltons and a pl of about 6.99. 
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An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown m Figure 157 (SEQ ID NO:229). revealed some sequence 
identity between the PROl 1 1 1 amino acid sequence and the following Dayhoff sequences: A58532, D86983 1. 
RNPLGPV^l, PGS2_HUMAN. AF038i27_l, ALS^MOUSE, GPV_HUMAN, PGS2^BOVIN. ALS.PAPPA 
and 147020. . 

EXAMPLE 68 : Isolation of cDNA clones Encoding Human PRO 1344 

A consensus DNA sequence was assembled relative to other EST sequences using phrap as described 
in Example 1 above. This consensus sequence is herein designated DNA33790. Based on the DNA33790 
consensus sequence, oligonucleotides were synthesized: 1) to idenufy by PGR a cDNA library that contained 
the sequence of interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for 
PR01344. 

PGR primers (forward and reverse) were synthesized: 
forward PGR primer S'^ACjCrrTrnTnATnnAnArAA.rrnrn.^* (SEQ ID NO:232) 
reverse PGR primer 5'.TGTGAAGGAGGGAGTGGCGTGATG.3' (SEQ ID NO:233) 
Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus DNA33790 
sequence which had the following nucleotide sequence 
hybridization probe 

5'-TGGCGAGATCATCAAGGGTGTCTGTGGGAAGGAGCGGCCAGCTGGTATGG-3' (SEQ ID NO:234) 
In order to screen several libraries for a source of a full-length clone, DNA from the libraries was 
screened by PGR amplification with the PGR primer pair identified above. A positive library was then used to 
isolate clones encoding the PR01344 gene using the probe oligonucleotide and one of the PGR primers. RNA 
for construction of the cDNA libraries was isolated from human fetal kidney tissue. 

DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for 
PROI344 (designated herein as DNA58723-1588 [Figure 158. SEQ ID NO:230]); and the derived protein 
sequence for PR01344. 

The entire nucleotide sequence of DNA58723-1588 is shown in Figure 158 (SEQ ID NO:230). Clone 
DNA58723-I588 contains a single q>en reading frame with an apparent translational initiation site at nucleotide 
positions 26-28 and ending at the stop codon at nucleotide positions 2186-2188 (Figure 158). The predicted 
polypeptide precursor is 720 amino acids long (Figure 159). The full-length PR01344 protein shown in Figure 
159 has an estimated molecular weight of about 80,199 daltons and a pi of about 7.77. Analysis of the full- 
length PR01344 sequence shown in Figure 159 (SEQ ID NO:231) evidences the presence of the following: a 
signal peptide from abcnit amino acid 1 to about amino acid 23, an EGF-like domain cysteine protein signanire 
sequence from about amino acid 260 to about amino acid 271 , potential N-glycosylation sites from about amino 
acid 96 to about amino acid 99, from about amino acid 279 to about amino acid 282, from about amino acid 316 
to about amino acid 319, from about amino acid 451 to about amino acid 454 and from about amino acid 614 
to about amino acid 6 1 7, an amino acid sequence block having homology to serine proteases, trypsin family from 
about amino acid 489 to about amino acid 505 and a GUB domain protein profile sequence from about amino. 
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acid 150 to about amino acid 166. Clone DNA58723-1588 has been deposited with ATCC on August 18. 1998 
and is assigned ATCC deposit no. 203133. 

An analysis of the Dayhoff database (version 35.45 SwissProi 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 159 (SEQ ID NO:231). evidenced significani 
homology between the PRO 1344 amino acid sequence and the following Dayhoff sequences: S77063_l, 
5 CRAR^MOUSE, P_R74775, P_P90070. P_,R09217. P_P70475. HSBMP16,1 and U50330^1. 

EXAMPLE 69: Isolation of cDNA clones Encoding Human PRO 1 109 

A consensus DNA sequence was assembled relative to odier EST sequences using phrap as described 
in Example 1 above. This consensus sequence is herein designated DNA52642. The consensus DNA sequence 

10 was obtained by extending using repeated cycles of BLAST and phrap a previously obtained consensus sequence 
as far as possible using the sources of EST sequences discussed above. Based on the DNA52642 consensus 
sequence, oligonucleotides were synthesized: 1) to identify by PCR a cDNA library that contained the sequence 
of interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for PROl 109. 
PCR primers (forward and reverse) were synthesized: 

15 forward PCR primer 5'-CCTTACCTCAGAGGCCAGAGCAAGC-3' (SEQ ID NO:237) 
reverse PCR primer 5'-GAGCTTCATCCGTTCTGCGTTCACC-3* (SEQ ID NO:238) 
Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus DNA52642 
sequence which had the following nucleotide sequence 
hvbridization probe 

20 5'-CAGGAATGTAAAGCTTTACAGAGGGTCGCCATCCTCGTTCCCCACC.3' (SEQ ID NO:239) 

In order to screen several libraries for a source of a full-length clone, DNA from the libraries was 
screened by PCR amplification with the PCR primer pair identified above. A positive library was then used to 
isolate clones encoding the PROl 109 gene using the probe oligonucleotide and one of the PCR primers. RNA 
for construction of the cDNA libraries was isolated from human SK-Lu- 1 adenocarcinoma cell tissue (LIB247). 

25 DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for 

PROl 109 (designated herein as DNA58737-1473 [Figure 160. SEQ ID NO:235]) and the derived protein 
sequence for PROl 109. 

The entire nucleotide sequence of DNA58737-1473 is shown in Figure 160 (SEQ ID NO:235). Clone 
DNA58737-1473 contains a single open reading frame with an apparent translational initiation site at nucleotide 

30 positions 1 19-120 and ending at the stop codon at nucleotide positions 1 15 M 153 (Figure 160). The predicted 
polypeptide precursor is 344 amino acids long (Figure 161). The full-length PROl 109 protein shown in Figure 
161 has an estimated molecular weight of about 40,041 dalions and a pi of about 9.34. Analysis of the full- 
length PROl 109 sequence shown in Figure 161 (SEQ ID NO:236) evidences the presence of the following: a 
signal peptide from about amino acid 1 to about amino acid 27, potential N-glycosylaiion sites from about amino 

35 acid 4 to about amino acid 7, from about amino acid 220 to about amino acid 223 and from about amino acid 
335 to about amino acid 338 and an amino acid sequence block having homology to xylose isomerase proteins 
from about amino acid 191 to about amino acid 201. Clone DNA58737-1473 has been deposited with ATCC 
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on August 18, 1998 and is assigned ATCC deposit no. 203136. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 161 (SEQ ID NO:236), evidenced significant 
homology between the PRO1109 amino acid sequence and the following Dayhoff sequences: HSUDPGAL_1, 
HSUDPB14_1, NALS^BOVIN, HSU10473_1. CEW02B12^11, YNJ4^CAEEL. AE000738^11, CET24D1_1, 
5 S48121andCEGLY9^1. 

EXAMPLE 70 : Isolation of cDNA clones Encoding Human PRO 1383 

A consensus DN A sequence was assembled relative to other EST sequences using phrap as described 
in Example 1 above. This consensus sequence is herein designated DNA53961. Based on the DNA5396] 
10 consensus sequence, oligonucleotides were synthesized: 1) to identify by PGR a cDNA library that contained 
the sequence of interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for 
PR01383. 

PGR primers (forward and reverse) were synthesized: 
forward PGR primer 5 -CATTTGCnTAGGGTGGAGGGAG<rrGC-3' (SEQ ID NO:242) 
15 reverse PGR primer 5'-GAAA(3(jGGGAGAGGAGATCTGGCAG-3' (SEQ ID NO:243) 

Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus DNAS3961 
sequence which had the following nucleotide sequence 
hvbridization probe 

5'.CCAGGAGGGGAGCAAGTTGCTCAAGAGGGAGTTGTTTCTCTACAGC-3' (SEQ ID NO:244) 
20 In order to screen several libraries for a source of a full-lengih clone, DNA from the libraries was 

screened by PGR amplification with the PGR primer pair identified above. A positive library was then used to 
isolate clones encoding the PR01383 gene usmg the probe oligonucleotide and one of the PGR primers. RNA 
for consmiction of the cDNA libraries was isolated from human fetal brain tissue. 

DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for 
25 PR01383 (designated herein as DNA58743-1609 (Figure 162, SEQ ID NO: 240]) and the derived protein 
sequence for PR01383. 

The entire nucleotide sequence of DNA58743-1609 is shown in Figure 162 (SEQ ID NO:240). Glone 
DNA58743-1609 contains a single open reading frame with an apparent translational initiation site at nucleotide 
positions 122-124 and ending at the stop codon at nucleotide positions 1391-1393 (Figure 162). The predicted 

30 polypeptide precursor is 423 amino acids long (Figure 163). The full-length PR01383 protein shown in Figure 
163 has an estimated molecular weight of about 46,989 daltons and a pi of about 6.77. Analysis of the full- 
length PR01383 sequence shown in Figure 163 (SEQ ID NO:241) evidences the presence of the following: a 
signal peptide from about amino acid 1 to about amino acid 24, a transmembrane domain from about amino acid 
339 to about amino acid 362, and potential N-glycosylation sites from about amino acid 34 to about amino acid 

35 37, from about amino acid 58 to about amino acid 61 , from about amino acid 142 to about amino acid 145, from 
about amino acid 197 to about amino acid 200, from about amino ^id ZOO to about amino acid 303 and from 
about amino acid 364 to about amino acid 367. Glone DNA58743-1609 has been deposited with ATGG on 



434 



wo 99/63088 



PCT/US99/12252 



August 25, 1998 and is assigned ATCC dq>osii no. 203154. 

An analysis of the Dayhoff database (version 35.45 SwissProi 35)» using a WU-BLAST2 sequence 
alignment analysis of the fiill-lengih sequence shown in Figure 163 (SEQ ID NO:241), evidenced significant 
homology between the PR01383 amino acid sequence and the following Dayhoff sequences: NMB^HUMAN, 
QNR^COTJA, P^W38335, P115_CHICK, P^W38164, A45993^l. MMU70209^l, D83704_l and P_W39176. 

5 

EXAMPLE 71 : Isolation of cDNA Clones Encoding Human PRO 1003 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
Incyte EST cluster sequence designated herein as 43055. This sequence was then compared to a variety of EST 
databases which included public EST daubases (e.g., GenBank) and a proprietary EST DNA database 

10 (UFESEQ™. Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The homology search 
was performed using the computer program BLAST or BLAST2 (Altshul et al.. Methods in Enzvmologv 
266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) or greater 
that did not encode known proteins were clustered and assembled into a consensus DNA sequence with the 
program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 

1 5 obtained therefrom is herein designated consenOl . 

In light of an observed sequence homology between the consensus sequence and an EST sequence 
encompassed within the Incyte EST clone no. 2849382, the Incyte EST clone 2849382 was purchased and the 
cDNA insen was obtained and sequenced. It was found that this insert encoded a full-length protein. The 
sequence of this cDNA insert is shown in Figure 164. 

20 The entire nucleotide sequence of DNA58846-1409 is shown in Figure 164 (SEQ ID NO:245). Clone 

DNA58846-1409 contains a single open reading frame with an apparent translation^ initiation site at nucleotide 
positions 41-43 and ending at the stop codon at nucleotide positions 293-295 (Figure 164). The predicted 
polypeptide precursor is 84 amino acids long (Figure 165). The fulMength PRO1003 protein shown in Figure 
165 has an estimated molecular weight of about 9,408 dahons and a pi of about 9.28. Analysis of the full-length 

25 PRO 1003 sequence shown in Fi]gure 165 (SEQ ID NO:246) evidences the presence of a signal peptide at amino 
acids 1 to about 24, and a cAMP- and cGMP-dependent protein kinase phosphorylation site at about amino acids 
58 to about 61 . Analysis of the amino acid sequence of the full-length PRO1003 polypeptide using the Dayhoff 
database (version 35.45 SwissProt 35) evidenced homology between the PRO 1003 amino acid sequence and the 
following Dayhoff sequences: AOPCZA363J. SRTX_ATREN, A48298, MHVJHMS^l, VGL2^CVMJH, 

30 DHDHTC2J, CORT^RAT, TAL6_HUMAN, P__W14123, and DVUFIJ. 

EXAMPLE 72: Isolation of cDNA Clones Encoidine Human PRO1108 

A consensus DNA sequence was assembled relative to other EST sequences using phnip as described 
in Example 1 above. This consensus sequence is herein designated DNA53237. 
35 In light of an observed sequence homology between the DNA53237 consensus sequence and an EST 

sequence encompassed within the Incyte EST clone no. 2379881, the Incyte EST clone 2379881 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
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The sequence of this cDNA insen is shown in Figure 166 and is herein designated DNA58848-1472. 

The entire nucleotide sequence of DNA58848-1472 is shown in Figure 166 (SEQ ID NO:247). Clone 
DNA58848-1472 contains a single open reading frame with an apparent translaiional initiation site at nucleotide 
positions 77-79 and ending at the stop codon at nucleotide positions 1445-1447 (Figure 166). The predicted 
polypeptide precursor is 456 amino acids long (Figure 167). The fiilMength PRO 1 108 protein shown in Figure 
167 has an estimated molecular weight of about 52,071 daltons and a pi of about 9.46. Analysis of the full- 
length PROl 108 sequence shown in Figure 167 (SEQ ID NO:248) evidences the presence of the foUowingiiype 
II transmembrane domains from about amino acid 22 to about amino acid 42, from about amino acid 156 to 
about amino acid 176, from about amino acid 180 to about amino acid 199 and from about amino acid 369 to 
about amino acid 388, potential N-glycosylaion sites from about amino acid 247 to about amino acid 250, from 
about amino acid 327 to about amino acid 330, from about amino acid 328 to about amino acid 331 and from 
about amino acid 362 to about amino acid 365 and an amino acid block having homology to ER lumen protein 
retaining receptor protein from about amino acid 153 to about amino acid 190. Clone DNA58848-1472 has been 
deposited with ATCC on June 9, 1998 and is assigned ATCC deposit no. 209955. 

Analysis of the amino acid sequence of the full-length PROl 108 polypepude suggests that it possesses 
significant sequence similarity to the LPAAT protein, thereby mdicating that PROl 108 may be a novel LPAAT 
homolog. More specifically, an analysis of the Dayhoff database (version 35.45 SwissProt 35) evidenced 
significant homology between the PRO 1108 amino acid sequence and the following Dayhoff sequences, 
AF01581L1, CER07E3_2, YL35_CAEEL, S73863, CEF59F4_4. P^W06422, MMU4I736_1, MTV008J9, 
P^R99248 and Y67_BPT7. 

EXAMPLE 73 : Isolation of cDNA Clones Encoding Human PROl 137 

The extracellular domain (ECD) sequences (including the secretion signal, if any) of from about 950 
known secreted proteins from the Swiss-Prot public protein database were used to search expressed sequence 
tag (EST) databases. The EST databases included public EST databases (e.g., GenBank) and a proprietary EST 
DNA database (LIFESEQ™, Incyte Pharmaceuticals, Palo Alto, CA). The search was performed using the 
computer program BLAST or BLAST2 (Alishul et al.. Methods in Enzvmologv 266:460-480 (1996)) as a 
comparison of the ECD protein sequences to a 6 frame u-anslation of the EST sequence. Using this procedure, 
incyte EST No. 3459449, also referred to herein as "DNA7108''. was identified as an EST having a BLAST 
score of 70 or greater that did not encode a known protein. 

A consensus DNA sequence was assembled relative to the DNA7108 sequence and other ESTs using 
repealed cycles of BLAST and die program •'phrap" (Phil Green, Univ. of Washington, Seattle, WA). The 
consensus sequence obtained therefrom is referred to herein as DNA539S2. 

In light of an observed sequence homology between the DNA53952 consensus sequence and an EST 
sequence encompassed within the Incyte EST clone.no. 3663102, the Incyte EST clone 3663102 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a fiill-lenglh protein. 
The sequence of this cDNA insert is shown in Figure 168. 



436 



wo 99/63088 



PCTAJS99/122S2 



The eniire nucleotide sequence of DNA58849-1494 is shown in Figure 168 (SEQ ID NO:249). Clone 
DNA58849-1494 contains a single open reading frame with an apparent translaiional initiation site at nucleotide 
positions 77-79 and ending at the stop codon at nucleotide positions 797-799 (Figure 168). The predicted 
polypeptide precursor is 240 amino acids long (Figure 169). The full-length PROl 137 protein shown in Figure 
169 has an estimated molecular weight of about 26,064 daltons and a pi of about 8.65. Analysis of the fiill- 
5 length PROl 137 sequence shown in Figure 169 (SEQ ID NO: 250) evidences the presence of a signal peptide 
at about amino acids 1 to 14 and a potential N-glycosylation site at about amino acids 101-105. 

Analysis of the amino acid sequence of the iull-length PROl 137 polypeptide suggests that it possesses 
significant sequence similarity to ribosyltransferase thereby indicating that PROl 137 may be a novel member 
of the ribosyltransferase family of proteins. Analysis of the amino acid sequence of the full-length PROl 137 
10 polypeptide using the Dayhoff database (version 35.45 SwissProi 35) evidenced homology between the PROl 137 
amino acid sequence and the following Dayhoff sequences: MMART5_l, NARG^MOUSE, GEN11909. 
GEN13794, GEN14406, MMRNART62_,1. and P_R41876. 

EXAMPLE 74: Isolation of cDNA clones Encoding Human PRO 11 38 

15 Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 

incyte EST sequence, Incyte cluster sequence no. 165212. This cluster sequence was then compared to a variety 
of expressed sequence tag (EST) daubases which included public EST databases (e.g., GenBank) and a 
proprietary EST DNA database (LIFESEQ™. Incyte Pharmaceuiicals, Palo Alio, CA) to identify existing 
homologies. The homology search was performed using the computer program BLAST or BLAST2 (Alishul 

20 ei al.. Methods in Enzvmologv 266:460480 (1996)). Those comparisons resulting in a BLAST score of 70 (or 
in some cases 90) or greater that did not encode known proteins were clustered and assembled into a consensus 
DNA sequence with the program **phrap" (Phil Green, University of Washington, Seattle, Washington). The 
consensus sequence obtained therefrom is herein designated as DN A54224. The assembly included a proprietary 
Genentech EST designated herein as DNA49140 (Figure 172; SEQ ID NO:254). 

25 In light of an observed sequence homology between the DNA54224 consensus sequence and an EST 

sequence encompassed within the Incyte EST clone no. 3836613, the Incyte EST clone 3836613 was purchased 
and the cDNA insen was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 170 and is the full-length DNA sequence for PROl 138. 
Clone DNA58850-1495 was deposited with the ATCC on June 9, 1998, and is assigned ATCC deposit no. 

30 209956. 

The entire nucleotide sequence of DNA58850-1495 is shown in Figure 170 (SEQ ID NO:252). Clone 
DNA58850-1495 contains a single open reading frame with an apparent translational initiation site at nucleotide 
positions 38-40 and ending at the stop codon at nucleotide positions 1043-1045 (Figure 170). The predicted 
polypeptide precursor is 335 amino acids long (Figure 171), The full-length PROl 138 protein shown in Figure 
35 171 has an estimated molecular weight of about 37,421 Dalions and a pi of about 6.36. Analysis of the full- 
length PROl 138 sequence shown in Figure 171 (SEQ ID NO:253) evidences the presence of the following 
features: a signal peptide at about amino acid 1 to about amino acid 22; a transmembrane domain at about amino 
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acids 224 to about 250; a leucine zipper pattern at about amino acids 229 to about 250; and potential N- 
glycosylation sites at about amino acids 98.101, 142-145, 148-151, 172-175, 176-179, 204-207, and 291-295. 

Analysis of the amino acid sequence of the fuU-lengih PROl 138 polypeptide suggests that it possesses 
significant sequence similarity to the CD84, thereby indicatmg that PROl 138 may be a novel member of the Ig 
superfamily of polypeptides. More panicularly , analysis of the amino acid sequence of the full-length PRO 1 138 
5 polypeptide using the Dayhoff database (version 35.45 SwissProt 35) evidenced homology between the PROl 138 
amino acid sequence and the following Dayhoff sequences: HSU82988_,1 . HUMLY9^1 , P^R9763 1 , P^R97628, 
P_R97629, P^R97630, CD48_RAT, CD2^HUMAN, P^P93996, and HUMBGP.l. 

Clone DNA58850.1495 was deposited with ATCC on June 9, 1998. and is assigned ATCC deposit no. 

209956. 

10 

EXAMPLE 75: Isolation of cPNA clones E ncoding Human PRO1054 

Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST 
cluster sequence from the Incyte database, designated 66212. This EST cluster sequence was then compared 
to a variety of expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) 
15 and a proprietary EST DNA database (LIFESEQ*. Incyte Pharmaceuticals, Palo Alto, CA) to identify existing 
homologies. The homology search was performed using the computer program BLAST or BLAST2 (Altshul 
et al.. Methods in Enzvmologv 266:460^80 (1996)). Those comparisons resulting in a BLAST score of 70 (or 
*m some cases 90) or greater that did not encode known proteins were clustered and assembled into a consensus 
DNA sequence with the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The 
20 consensus sequence obtained therefrom is herein designated DNA55722. 

In light of an observed sequence homology between the DNA55722 consensus sequence and an EST 
sequence encompassed within the Incyte EST clone no. 319751, the Incyte EST clone 319751 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 173 and is herein designated as DNA58853-1423. 
25 Clone DNA58853-1423 contains a single open reading frame with an apparent translational initiation 

site at nucleotide positions 46-48 and ending at the stop codon at nucleotide positions 586-588 (Figure 173). The 
predicted polypeptide precursor is 180 amino acids long (Figure 174). The full-length PRO1054 protein shown 
in Figure 174 has an estimated molecular weight of about 20,638 daltons and a pi of about 5.0. Analysis of the 
fiilHength PRO 1054 sequence shown in Figure 174 (SEQ ID NO:256) evidences the presence of the foUowing: 
30 a signal peptide from about amino acid I to about amino acid 18, a leucine zipper pattern from about amino acid 
155 to about ammo acid 176 and amino acid sequence blocks having homology to lipocalin proteins from about 
amino acid 27 to about amino acid 38 and from about amino acid 110 to about amino acid 120. Clone 
DNA58853-1423 has been deposited with ATCC on June 23, 1998 and is assigned ATCC deposit no. 203016. 
An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
35 alignment analysis of the full-length sequence shown in Figure 174 (SEQ ID NO:256), evidenced significant 
homology between the PRO 1054 amino acid sequence and the following Dayhoff sequences: MUPl^MOUSE, 
MUP6_M0USE. MUP2_M0USE, MUP8,MOUSE, MUP5.M0USE, MUP4^MOUSE. S10124, 
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MUPM^MOUSE. MUP^RAT and ECU70823^1. 

EXAMPLE 76 : Isolation of cDNA clones Encoding Human PR0994 

Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST 
cluster sequence from the Incyte database, designated 157555. This EST cluster sequence was then compared 
5 to a variety of expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) 
and a proprietary EST DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto. CA) to identify existing 
homologies. The homology search was performed using the computer program BLAST or BLAST2 (Altshul 
et al. , Methods in Enzvmology 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or 
in some cases 90) or greater that did not encode known proteins were clustered and assembled into a consensus 

10 DNA sequence with the program *"phrzp"* (Phil Green, University of Washington, Seattle, Washington). The 
consensus sequence obtained therefrom is herein designated DNAS5728. 

In light of an observed sequence homology between the DNAS5728 consensus sequence and an EST 
sequence encompassed within the Incyte EST clone no. 2860366, the Incyte EST clone 2860366 was purchased 
and the cDNA insen was obtained and sequenced. It was found that this insert encoded a fulMength protem. 

IS The sequence of this cDNA insen is shown in Figure 175 and is herein designated as DNA58855-1422. 

Clone DNA58855-1422 contains a single open reading frame with an apparent translational initiation 
site at nucleotide positions 31-33 and ending at the stop codon ai nucleotide positions 718-720 (Figure 175). The 
predicted polypeptide precursor is 229 amino acids long (Figure 176). The full-length PR0994 protein shown 
in Figure 176 has an estimated molecular weight of about 25,109 daltons and a pi of about 6.83. Analysis of 

20 the full-length PR0994 sequence shown in Figure 176 (SEQ ID NO:258) evidences the presence of the 
following: transmembrane domains from about amino acid 10 to about amino acid 31, from about amino acid 
50 to about amino acid 72, from about amino acid 87 to about amino acid 1 10 and from about amino acid 191 
to about amino acid 213, potential N-glycosylation sites from about amino acid 80 to about amino acid 83, from 
about amino acid 132 to about amino acid 135, from about amino acid 148 to about amino acid 151 and from 

25 about amino acid 163 to about amino acid 166 and an amino acid block having homology to TNFR/NGFR 
cysteine-rich region proteins from about amino acid 4 to about amino acid 1 1 . Clone DNA58855- 1422 has been 
deposited with ATCC on June 23, 1998 and is assigned ATCC deposit no. 203018. 

An analysis of the Dayhoff daubase (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 176 (SEQ ID NO:258), evidenced significant 

30 homology between the PR0994 amino acid sequence and the following Dayhoff sequences: AF027204_1, 
TAL6_HUMAN. ILT4_HUMAN, JC6205. MMU57570_1. S40363, ETU56093_1. S42858, P^R66849 and 
P_R74751. 

EXAMPLE 77: Isolation of cDNA clones Encoding Human PR0812 
35 Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST 

cluster sequence from the Incyte database, designated 170079. This EST cluster sequence was then compared 
to a variety of expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) 



8NS0OCID <WO_ 396^mA2 i > 



wo 99/63088 



PCTAJS99/122S2 



and a proprietary EST DNA database (Ufeseq* Incyte Pharmaceuticals. Palo Alto. CA) to identify existing 
homologies. The homology search was performed tising the comptuer program BLAST or BLAST2 ( Alishul 
et al- Methods in Enzvmdogv 266:46(M80 (1996)). Those comparisons resulting in a BLAST score of 70 (or 
in some cases 90) or greater that did not encode known proteins were clustered and assembled into a consensus 
DNA sequence with the program "phrap" (Phil Green. University of Washington. SeatUe, Washington). The 
consensus sequence obtained therefrom is herein designated as DNA55721. 

In light of an observed sequence homology between die DNA55721 consensus sequence and an EST 
sequence encompassed within the facyte EST clone no. 388964. the Incyte EST clone 388964 was purchased 
and the cDNA insen was obtained and sequenced. It was found that this insert encoded a fiill-length protein. 
The sequence of this cDNA insert is shown in Fipire 177 and is herein designated as DNA59205-1421 . 

Clone DNA59205-1421 contains a single open reading frame with an apparent translational initiation 
site at nucleotide positions 55-57 and ending at die stop codon at nucleotide positions 304-306 (Figure 177). The 
predicted polypeptide precursor is 83 amino acids long (Figure 178). The full-length PR0812 protein shown 
in Figure 178 has an estimated molecular weight of about 9.201 daltons and a pl of about 9.3. Analysis of die 
fiill-lengdi PR0812 sequence shown in Figure 178 (SEQ ID NO:260) evidences die presence of die following: 
a signal peptide from about amino acid 1 to about amino acid 15. a cAMP- and cGMP-dependent protein kinase 
phosphorylation site from about amino acid 73 to about amino acid 76 and protein kinase C phosphorylation sites . 
from about amino acid 70 to about amino acid 72 and from about amino acid 76 to about amino acid 78. Clone 
DNA59205-1421 has been deposited widi ATCC on June 23, 1998 and is assigned ATCC deposit no. 203009. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignmem analysis of die fulI-lengUi sequence shown in Figure 178 (SEQ ID NO:260), evidenced significant 
homology between die PR0812 amino acid sequence and the following Dayhoff sequences: P_W35802. 
P_W35803. PSC1_RAT. S68231, GENI3917. PSC2_RAT. CC 10_HUMAN.UTER.RABIT. AF008595_1 and 
A56413. 

EXAMPLE 78 : Isolation of cD NA clones Encoding Human PRO1069 

Use of die signal sequence algoridun described in Example 3 above aUowed identification of a single 
Incyte EST sequence tfcsignated herein as 100727. This sequence was dien compared to a prq)rietary EST 
DNA database (UFESEQ™, Incyte Pharmaceuticals. Palo Alto, CA) to identify existing homologies. The 
homology search was performed using die computer program BLAST or BLAST2 (Altshul et al.. Metfiods jn 
FtiT^oloev 266:460-480 (19%)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater dut did not encode known proteins were clustered and assembled into a consensus DNA sequence witii 
die program "phrap" (Phil Green. Univ. of Washington. SeaUlc, Washington). The consensus sequence obtained 
therefrom is herein designated DNAS6001. 

In light of an observed sequence homology between die DNA56001 consensus sequence and an EST 
sequence encompassed widiin die Incyte EST clone no. 3533881 . die Incyte EST clone 3533881 was purchased 
and die cDNA insert was obtained and sequenced. It was found diat diis insert encoded a full-lengdi protein. 
The sequence of diis cDNA insert is shown in Figure 179 and is die full-lengdi DNA sequence for PRO1069. 
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Clone DNA592n-1450 was deposited wi* the ATCC on June 9. 1998. and is assigned ATCC deposit no. 
209960. 

The entire nucleotide sequence of DNA5921 1-1450 is shown in Figure 179 (SEQ ID NO:261). Clone 
DNA5921 1-1450 contains a single open reading frame with an apparent translational initiation site at nucleotide 
positions 197-199 and ending at the stop codon at nucleotide positions 464-466. The predicted polypeptide 
5 precursor is 89 amino acids long (Figure 180). The full-length PRO1069 protein shown in Figure 180 has an 
estimated molecular weight of about 9,433 daltons and a pi of about 8.21 . Analysis of the fiiU-length PRO1069 
sequence shown in Figure 180 (SEQ ID NO:262) evidences die presence of the following features: a signal 
peptide sequence at amino acid 1 to about 16; a transmembrane domain at about amino acids 36 to about 59; 
potential N-myristoylation sites at about amino acids 41^. 45-50. and 84-89; and homology widi extracellular 
10 proteins SCP/Tpx-l/Ag5/PR-l/Sc7 at about amino acids 54 to about 66. 

Analysis of die amino acid sequence of the fWl-length PRO1069 polypeptide suggests that it possesses 
significant sequence similarity to CHIF. dtereby indicating Uiat PRO1069 may be a member of die CHIF family 
of polypeptides. More panicularly, analysis of die amino acid sequence of die full-lengdi PRO1069 polypeptide 
using die Dayhoff database (version 35 .45 SwissProt 35) evidenced homology between die PRO1069 amino acid 
15 sequence and die following. Dayhoff sequences: CHIF_RAT. A55571, PLM_HUMAN. A40533. 
ATNG_BOVIN. RIC_MOUSE. PETD_SYNY3. VTB1_XENLA. A05009. and S75086. 

Clone DNA5921 1-1450 was deposited witti die ATCC on June 9, 1998. and is assigned ATCC deposit 
no. 209960. 

20 EXAMPLE 79: Isolation of cDNA Clones Encoding Human PROl 129 

Use of die signal sequence algoridun described in Example 3 above allowed identification of a single 
Incyte EST cluster sequence designated herein as 98833. The Incyte EST cluster sequence no. 98833 sequence 
was Uien compared to a variety of expressed sequence tag (EST) databases which included public EST databases 
(e.g.. GcnBank) and a proprietary EST DNA database (LIFESEQ^". Incyte PharmaceuticaU, Palo Alto. CA) 
25 to identify existing homologies. The homology scan* was performed using die computer program BLAST or 
BLAST2 (Altshul et al.. MP.hnri. in Enzymoloev 266:460^80 (1996)). Those comparisons resulting in a 
BLAST score of 70 (or in some cases 90) or greater diat did not encode known proteins were clustered and 
assembled into a consensus DNA sequence widi die program "phrap" (Phil Green. University of Washington. 
Seatde. Washington). The consensus sequence obtained dierefrom is herein designated DNA56038. 
30 In light of an observed sequence homology between die DNA56038 consensus sequence and an EST 

sequence encompassed widiin die Incyte EST clone no. 1335241. die Incyte EST clone 1335241 was purchased 
and die cDNA insen was obtained and sequenced. It was found diat dus insert encoded a full-lcngdi protein. 
The sequence of diis cDNA insert is shown in Figure 181 and is herein designated DNA59213-1487. 

The fiill lengdi clone diown in Figure 181 contained a single open reading frame widi an apparent 
35 translational initiation site at nucleotide positions 42-44 and ending at die stop codon found at nucleotide positions 
1614-1616 (Figure 181; SEQ ID NO:263). The predicted polypeptide precursor is 524 amino acids long, has 
a calculated molecular weight of approximately 60,310 daltons and an estimated pi of approximately 7.46. 
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Analysis of ihe ftill-length PROl 129 sequence shown in Figure 182 (SEQ ID NO:264) evidences the presence 
of the following: type II transmembrane domains from about amino acid 13 to about amino acid 32 and from 
about amino acid 77 to about amino acid 102, a cytochrome P-450 cysteine heme-iron ligand signanire sequence 
from about amino acid 461 to alK)ut amino acid 470 and potential N-glycosyiation sites from about amino acid 
112 to about amino acid 115 and from about amino acid 168 to about amino acid 171. Clone DNA59213-1487 
5 has been deposited with the ATCC on June 9, 1998 and is assigned ATCC deposit no. 209959. 

Analysis of the amino acid sequence of the full-length PROl 129 polypeptide suggests that it possesses 
sequoice similarity to the cytochrome P-450 family of proteins. More specifically, an analysis of the Dayhoff 
database (version 35.45 SwissProt 35) evidenced some degree of homology between the PROl 129 amino acid 
sequence and the following Dayhoff sequences, AC004S23_1, S45702, AF054821_1 and 153015. 

10 

EXAMPLE 80 : Isolation of cDNA clones Encoding Human PRO1068 

Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST 
cluster sequence from the LIFESEQ® database, designated Incyte cluster no. 141736. This EST cluster sequence 
was then compared to a variety of expressed sequence tag (EST) databases which included public EST databases 

15 (e.g., GenBank) and a proprietary EST DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) to 
identify existing homologies. One or more of the ESTs was derived from a human mast cell line from a patient 
with mast cell leukemia. The homology search was performed using the computer program BLAST or BLAST2 
(Altshul et al.. Methods in Enzvmology 266:460-480 (1996)). Those comparisons resulting in a BLAST score 
of 70 (or in some cases 90) or greater that did not encode known proteins were clustered and assembled into a 

20 consensus DNA sequence with the program "phrap" (Phil Green. University of Washington, Seattle, 
Washington). The consensus sequence obtained therefrom is herein designated DNA56094. 

In light of an observed sequence homology between the DNA56094 consensus sequence and an EST 
sequence encompassed within the Incyte EST clone no. 004974, the Incyte EST clone 004974 was purchased 
and the cDNA insen was obtained and sequenced. It was found that this insen encoded a full-length protein. 

25 The sequence of this cDNA insert is shown in Figure 183 and is herein designated as DNA59214-1449 (SEQ 
ID NO:265). 

The full length clone shown in Figure 183 contained a single open reading frame with an apparent 
translational initiation site at nucleotide positions 42-44 and ending at the stop codon fotmd at nucleotide positions 
414-416 (Figure 183; SEQ ID NO:265). The predicted polypeptide precursor (Figure 184, SEQ ID NO:266) 

30 is 124 amino acids long. PRO1068 has a calculated molecular weight of approximately 14,284 daltons and an 
estimated pi of approximately 8.14. The PRO1068 polypeptide has the following additional feanires. as 
indicated in Figure 184: a signal peptide sequence at about amino acids 1-20, a urotensin II signamre sequence 
at about amino acids 118-123, a cell attachment sequence at about amino acids 64-66, and a potential cAMP- 
and cGMP-dependem protein kinase phosphorylation site at about amino acids 1 12-1 15. 

35 An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 

alignment analysis of the full-length sequence shown in Figure 184 (SEQ ID NO:266). revealed homology 
between the PRO1068 amino acid sequence and the following Dayhoff sequences: HALB0P_1, MTV043_36, 
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150498, and P_R78445 

Clone DNA59214-1449 was deposited with the ATCC on July 1, 1998 and is assigned ATCC deposit 
no.203046. 

EXAMPLE 81 : Isolation of cDNA clones Encoding Human PRO1066 
5 Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 

Incyte EST cluster sequence designated herein as 79066. The Incyte EST cluster sequence no. 79066 sequence 
was then compared to a variety of expressed sequence tag (EST) databases which included public EST databases 
(e.g.. GenBank) and a proprietary EST DNA database (LIFESEQ^*^, Incyte Pharmaceuticals, Palo Alto, CA) 
to identify existing homologies. The homology search was performed using the computer program BLAST or 

10 BLAST2 (Altshul et al.. Methods in Enzvmologv 266:460-480 (1996)). Those comparisons resulting in a 
BLAST score of 70 (or in some cases 90) or greater that did not encode known proteins were clustered and 
assembled into a consensus DNA sequentfc with the program *'phrap" (Phil Green, University of Washington, 
Seattle, Washington). The consensus sequence obtained therefrom is herein designated DNA56121. 

In light of an observed sequence homology between the DNA56121 consensus sequence and an EST 

15 sequence encompassed within the Incyte EST clone no. 1515315, the Incyte EST clone 1515315 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a fuil-lengih protein. 
The sequence of this cDNA insert is shown in Figure 185 and is herein designated DNA59215-1425. 

The full length clone shown in Figure 185 contained a single open reading frame with an apparent 
translational initiation site at nucleotide positions 176-178 and ending at the stop codon found at nucleotide 

20 positions 527-529 (Figure 185; SEQ ID NO:267). The predicted polypeptide precursor is 1 17 amino acids long, 
has a calculated molecular weight of approximately 12,91 1 dalions and an estimated pi of approximately 5.46. 
Analysis of the ftill-length PRO 1066 sequence shown in Figure 186 (SEQ ID NO:268) evidences the presence 
of the following: a signal peptide from about amino acid 1 to about amino acid 23. a cAMP- and cGMP- 
dependent protein kinase phosphorylation site from about amino acid 38 to about amino acid 41 and potential 

25 N-myristoylation sites from about amino acid 5 to about amino acid 10, from about amino acid 63 to about amino 
- acid 68 and from about amino acid 83 to about amino acid 88. Clone UNQ524 (DNA59215-1425) has been 
deposited with the ATCC on June 9, 1998 and is assigned ATCC deposit no. 209961. 

Analysis of the amino acid sequence of the full-length PRO 1066 polypeptide suggests that it does not 
possess significant sequence similarity to any known human protein. However, an analysis of the Dayhoff 

30 database (version 35.45 SwissProt 35) evidenced some degree of homology between the PRO1066 amino acid 
sequence and the following Dayhoff sequences, MOTI.HUMAN, AF025667_1, MTCY19H9_8 and 
RABIGKCH^l. 

EXAMPLE 82: Isolation of cDNA Clones Encodin g Human PRO 1 184 
35 Use of the signal sequence algorithm described in Example 3 on ESTs from an Incyte database allowed 

identification a candidate sequence designated herein as DNA56375. This sequence was then compared to a 
variay of expressed sequence tag (EST) databases which included public EST databases (e.g., CknBank) and 
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a proprietary EST DNA database (LIFESEQ™, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing 
homologies. The homology search was performed using the computer program BLAST or BLAST2 (Altshul 
ct al., Methods in Enzvmology 266:46(M80 (1996)). Those comparisons resulting in a BLAST score of 70 (or 
in some cases 90) or greater that did not encode known proteins were clustered and assembled into a consenstis 
DNA sequence with the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The 
5 consensus sequence obtained therefrom is herein designated DNA56375. 

In light of an observed sequence homology between the DNA56375 consensus sequence and an EST 
sequence encompassed within the Incyte EST clone no. 1428374, the Incyte EST clone 1428374 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a fiilMength protein. 
The sequence of this cDNA insert is shown in Figure 187. 

10 The full length clone shown in Figure 187 contained a single open readmg frame with an apparent 

translattonal initiation site at nucleotide positions 106-108 and ending at the stop codon found at nucleotide 
positions 532-534 (Figure 187; SEQ ID NO:269). The predicted polypeptide precursor is 142 amino acids long, 
has a calculated molecular weight of approximately 15,690 daltons and an estimated pi of approximately 9.64. 
Analysis of the full-length PROl 184 sequence shown in Figure 188 (SEQ ID NO:270) evidences the presence 

IS of a signal peptide at about amino acids 1-38. Clone DNA59220-1514 has been deposited with the ATCC on 
June 9, 1998. It is understood that the deposited clone has the actual sequences and that representations are 
presented herein. 

Analysis of the amino acid sequence of the full-length PROl 184 polypeptide suggests that it possesses 
some sequence identity with a protein called TIM from Drosophila virilis, designated '*DVTIMS02_r in the 
20 Dayhoff data base, (version 35.45 SwissProt 35). Other 

Dayhoff database (version 35.45 SwissProt 35) sequences having some degree of sequence identity with 
PR01184 include: WISl^SCHPO, F002186_1, ATAC00239124 and MSAIPRP_1. 

EXAMPLE 83 : Isolation of cDNA clones Encoding Human PRO 1360 

25 Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST 

sequence from an Incyte database, designated DNA10572. This EST sequence was then compared to a variety 
of expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank, Merck/Wash. 
U.) and a proprietary EST DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) to identify 
existing homologies. The homology search was performed using the computer program BLAST or BLAST2 

30 (Altshul ci al. , Methods in Enzvmoloev 266:460480 (1996)). Those comparisons resulting in a BLAST score 
of 70 (or in some cases 90) or greater that did not encode known proteins were clustered and assembled iiuo a 
consensus DNA sequence with the program "phrap" (Phil Green, University of Washington, Seattle, 
Washington). The consensus sequence obtained therefrom is herein designated DNA57314. 

In light of an observed sequence homology, between the DNA57314 consensus sequence and an EST 

35 sequence encompassed within the Merck EST clone no. AA406443, the Merck EST clone AA406443 was 
purchased and the cDNA insert was obtained and sequenced. It was foimd that this insen encoded a full-length 
protein. The sequence of this cDNA insen is shown m Figure 189 and is herein designated as DNA59488-1603. 
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The fiill length clone shown in Figure 189 contained a single open reading frame with an apparent 
translational initiation site at nucleotide positions 54-56 and ending at the stop codon found at nucleotide positions 
909-911 (Figure 189; SEQ ID NO:27l). The predicted polypeptide precursor (Figure 190. SEQ ID NO:272) 
is 285 amino acids long. PRO 1360 has a calculated molecular weight of approximately 31 ,433 daltons and an 
estimated pi of approximately 7.32. Clone DNA59488-1603 was deposited with the ATCC on August 25, 1998 
5 . and is assigned ATCC deposit no. 203157. 

An analysis of the Dayhoff database (version 35.45 SwissProi 35), using a WU-BLAST2 sequence 
aligmnem analysis of the full-length sequence shown in Figure 190 (SEQ ID NO:272). revealed sequence identity 
between the PROi360 amino acid sequence and the following Dayhoff sequences: UN51_CAE£L, 
YD4B^SCHP0, AF000634^1. GFO^ZYMMO. YEIJ^SCHPO. D86566_l, ZMGFO.l, S76976, 
10 PPSA^SYNY3, and CEF28B1_4. 

EXAMPLE 84: Isolation of cDNA clones Encoding Human PRO1029 

Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST 
cluster sequence from the Incyte database, designated 18763. This EST cluster sequence was then compared 

15 to a variety of expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) 
and a proprietary EST DNA database (LIFESEQ®, Incyte Pharmaceuticals. Palo Alto, CA) to identify existing 
homologies. The homology search was performed using the computer program BLAST or BLAST2 (Altshul 
et al.. Methods in Enzvmoiogv 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or 
in some cases 90) or greater that did not encode known proteins were clustered and assembled into a consensus 

20 DNA sequence with the program *'phrap" (Phil Green, University of Washington, Seattle, Washington). The 
consensus sequence obuined therefrom is herein designated DNA57854. 

In light of an observed sequence homology between the DNA57854 consensus sequence and an EST 
sequence encompassed withm the Merck EST clone no. T98880. the Merck EST clone T98880 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a fiill-length protein. 

25 The sequence of diis cDNA insen is shown m Figure 191 and is herein designated as DNA59493-1420. 

Clone DNA59493-1420 contains a single open reading frame with an apparent translational initiation 
site at nucleotide positions 39-4 1 and ending at the stop codon at micleotide positions 297-299 (Figure 191). The 
predicted polypeptide precursor is 86 amino acids long (Figure 192). The fiill-length PRO1029 protein shown 
in Figure 192 has an estimated molecular weight of about 9,548 daltons and a pi of about 8.52. Analysis of the 

30 full-length PROI029 sequence shown in Figure 192 (SEQ ID NO:274) evidences the presence of the following: 
a signal peptide from about amino acid 1 to about amino acid 19, an amino acid block having homology to 
bacterial rtiodopsins retinal binding site protein from about amino acid 50 to about amino acid 61 , a prenyl group 
binding site from about amino acid 83 to about amino acid 86 and a potential N-glycosylation site from about 
amino acid 45 to about amino acid 48. Clone DNA59493-1420 has been deposited with ATCC on July 1 , 1998 

35 and is assigned ATCC deposit no. 203050, 

An analysis of the Dayhoff database (version 35.45 SwissProi 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 192 (SEQ ID NO:274), evidenced significant 
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homology between the PROI029 amino acid sequence and the following Dayhoff sequences: S66088« 
AF031815^i, MM4A6L^1, PSEIS52a-l. S17699 and P^R63635. 

EXAMPLE 85 : Isolation of cDNA clones Encoding Human PROl 139 

Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST 
5 cluster sequence from the Incyte database, designated 4461 . This EST cluster sequence was then compared to 
a variety of expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) and 
a proprietary EST DNA database (UFESEQ®, Incyte Pharmaceuticals. Palo Alto, CA) to identify existing 
homologies. The homology search was performed using the computer program BLAST or BLAST2 (Altshul 
ct al.. Methods in Enzvmoloev 266:460-480 (1996)), Those comparisons resulting in a BLAST score of 70 (or 

10 in some cases 90) or greater that did not encode known proteins were clustered and assembled into a consensus 
DNA sequence with the program ''phrap" (Phil Green, University of Washington, Seattle, Washington). The 
consensus sequence obtained therefrom is herein designated DNA57312. 

The DNA57312 consensus sequence included a 172 nucleotides long public EST (T62095, 
Merck/University of Washington public database). This EST clone, identified herein as a putative protein coding 

15 sequence, was purchased from Merck, and sequenced to provide the coding sequence of PROl 139 (Figure 193). 
As noted before, the deduced amino acid sequence of DNA59497- 1496 shows a significant sequence identity with 
the deduced amino acid sequence of HS0BRGRP_1. The full-length protein (Figure 194) contains a putative 
signal peptide between amino acid residues 1 and about 28. and three putative transmembrane domains 
(approximate amino acid residues 33-52, 71-89, 98-120). 

20 

EXAMPLE 86: Isolation of cDNA clones Encoding Human PRO 1309 

An expressed sequence tag (EST) DNA database (LIFESEQ^. Incyte Pharmaceuticals, Palo Alto, CA) 
was searched and an EST was identified which showed homology to SLIT. 

RNA for constroiction of cDNA libraries was isolated from human fetall brain tissue. The cDNA 
25 libraries used to isolate the cDN A clones encoding human PRO 1 309 were constructed by standard methods using 
conunercially available reagents such as those from Invitrogen, San Diego, CA. The cDNA was primed with 
oligo dT containing a NotI site, linked with blunt to Sail hemikinased adaptors, cleaved with NotI, sized 
appropriately by gel electrophoresis, and cloned in a defined orientation into a suitable cloning vector (such as 
pRKB or pRKD; pRKSB is a precursor of pRKSD that does not contain the Sfil site; see. Holmes et al.. Science . 
30 253: 1278-1280 (1991)) in the unique Xhol and NotI. 

The cDNA libraries (prepared as described above), were screened by hybridization with a synthetic 
oligonucleotide probe derived from the above described Incyte EST sequence: 

5*-TCCGTGCAGGGGGACGCCmTCAGAAACTGCGCCGAGrrAAGGAAC-3' (SEQ ID NO:279). 

A cDNA clone was isolated and sequenced in entirety. The entire nucleotide sequence of DNA59588- 
35 1571 is shown in Figure 195 (SEQ ID NO:277). Clone DNA59588-1571 contains a single open reading frame 
with an apparent translational initiation site at nucleotide positions 720-722 and a stop codon at nucleotide 
positions 2286-2288 (Figure 195; SEQ ID NO:277). The predicted polypeptide precursor is 522 amino acids 
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long. The signal peptide is approximately at 1-34 and the transmembrane domain is at approximately 428-450 
of SEQ ID NO:278. Clone DNA59588-1571 has been deposited with ATCC and is assigned ATCC deposit no. 
203106. The fiill-iength PRO1309 protein shown in Figure 196 has an estimated molecular weight of about 
58,614 daltons and a pi of about 7.42. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35). using a WU-BLAST2 sequence 
5 alignment analysis of the full-length sequence shown in Figure 196 (SEQ ID NO:278). revealed sequence identity 
between the PRO1309 amino acid sequence and the following Dayhoff sequences: AB007876_1, GPV MOUSE, 
ALS^RAT. P_R85889, LUM^CHICK, AB014462^1, PGSLCANFA. CEM88J, A58532 and GEN11209. 

EXAMPLE 87: Isolation of cDNA Clones Encoding Human PRO 1028 

10 Use of the signal sequence algorithm described in Example 3 above allowed identiOcaiion of a cenain 

EST cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g. , GenBank) and a proprietary 
EST DNA database (UFESEQ^, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 
homology search was performed using the computer program BLAST or BLAST2 (Altshul et al.. Methods in 

15 Enzvmology 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
pr greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program **phrap" (Phil Green, University of Washington, Seattie, Washington). The consensus sequence 
obtained therefrom is herein designated DNA59603. 

In light of an observed sequence homology between the DNA59603 sequence and an EST sequence 

20 contained within Incyte EST clone no. 1497725, the Incyte EST clone no. 1497725 was purchased and die cDNA 
insert was obtained and sequenced. It was found that the insert encoded a full-length protein. The sequence of 
this cDNA insert is shown in Figure 197 and is herein designated as DNA59603-1419. 

The entire nucleotide sequence of DNA59603-1419 is shown in Figure 197 (SEQ ID NO:280). Clone 
DNA59603-1419 contains a single open reading, frame with an apparem translational initiation site at nucleotide 

25 positions 21-23 and ending at the stop codon at nucleotide positions 612-614 (Figure 197). The predicted 
polypeptide precursor is 197 anuno acids long (Figure 198). The full-length PRO1028 protein shown in Figure 
198 has an estimated molecular weight of about 20,832 daltons and a pi of about 8.74. Clone DNA59603-1419 
has been deposited with the ATCC. Regarding the sequence, it is understood that die deposited clone contains 
the correct sequence, and die sequences provided herein are based on known sequencing techniques. 

30 Analyzing the amino acid sequence of SEQ ID NO:28I, the putative signal peptide is at about amino 

acids 1-19 Of SEQ ID NO:281. An N-glycosylation site is at about amino acids 35-38 of SEQ ID NO:281. A 
C-type lectin domain is at about amino acids 108- 1 1 7 of SEQ ID NO:28 1 , indicating that PROS 1 3 may be related 
to or be a lectin. The corresponding nucleotides of these anuno acid sequences or others can be routinely 
determined given the sequences provided herein. 

35 
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EXAMPLE 88: Isolation of cDNA Clones Encoding Human PRO 1027 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a certain 
EST cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g. , GenBank) and a proprietary 
EST DNA database (LIFESEQ*, Incyte Pharmaceuticals, Palo Alto» CA) to identify existing homologies. The 
5 homology search was performed using die computer program BLAST or BLAST2 (Altshul et ai.. Methods in 
Enzvmology 266:460-480 (19%)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were chistered and assembled into a consensus DNA sequence with 
the program **phrap'* (Phil Green, University of Washington. Seattle, Washington). The consensus sequence 
obtained therefrom is herein designated DNA56399. 

10 In light of an observed sequence homology between the DNA56399 sequence and an EST sequence 

contained within Incyte EST clone no. 937605, the Incyte EST clone no. 93760S was purchased and the cDNA 
insert was obtained and sequenced. It was found that the insen encoded a full-length protein. The sequence of 
this cDNA insen is shown in Figure 199 and is herein designated as DNA59605-1418. 

The entire nucleotide sequence of DNA59605-14I8 is shown in Figure 199 (SEQ ID NO:282). Clone 

IS DNA59605-14i8 contains a single open reading frame with an apparent translational initiation site at nucleotide 
positions 31-33 and ending at the stop codon at nucleotide positions 262-264 (Figure 199). The predicted 
polypeptide precursor is 77 amino acids long (Figure 200). The full-length PRO1027 protein shown in Figure 
200 has an estimated molecular weight of about 8,772 daltons and a pi of about 9.62, Clone DNA59605-1418 
has been deposited with the ATCC. Regarding the sequence, it is understood that the deposited clone conuins 

20 the correct sequence, and the sequences provided herein are based on known sequencing techniques. 

Analyzing the amino acid sequence of SEQ ID NO:283, the putative signal peptide is at about amino 
acids 1-33 of SEQ ID NO:283. The type II fibronectin collagen-binding domain begins at about amino acid 30 
of SEQ ID NO:283. The corresponding nucleotides for these amino acid sequences and others can be routinely 
determined given the sequences provided herein. PRO1027 may be involved in tissue formation or repair. 

25 The following Dayhoff designations appear to have some sequence identity with PRO 1027: 

SFT2^YEAST; ATM3E9_2: A69826; YM16_MARP0; E64896; U60193_2; MTLRAJ205_1; MCU60315_70; 
SPAS^SHIFL; and S54213. 

EXAMPLE 89: Isolation of cDNA Clones Encoding Human PROl 107 

30 Use of the signal sequence algorithm described m Example 3 above allowed identification of a certain 

EST cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g. , GenBank) and a proprietary 
EST DNA database (UFESEQ^, Incyte Pharmaceuticals. Palo Alto, CA) to identiiy existing homologies. The 
homology search was performed using the computer program BLAST or BLAST2 (Altshul et al.. Methods in 

35 Enzvmologv 266:46(M80 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program "phrap" (Riil Green. University of Washington. Seattle. Washington). The consensus sequence 
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obtained there^om is herein designated DNA56402. 

In light of an observed sequence homology between the DNA56402 sequence and an EST sequence 
contained within Incyte EST clone no. 3203694, the Incyte EST clone no. 3203694 was purchased and the cDNA 
insert was obtained and sequenced. It was found that the insert encoded a full-length protein. The sequence of 
this cDNA insen is shown in Figure 201 and is herein designated as DNA59606-1471. 
5 The entire nucleotide sequence of DNA59606-1471 is shown in Figure 201 (SEQ ID NO:284). Clone 

DNA59606-1471 contains a single open reading frame with an apparent translational initiation site at nucleotide 
positions 244-246 and ending at the stop codon at nucleotide positions 1675-1677 of SEQ ID NO:284 (Figure 
201). The predicted polypeptide precursor is 477 amino acids long (Figure 202). The fiilMength PR01107 
protein shown m Figure 202 has an estimated molecular weight of about 54,668 daltons and a pi of about 6.33. 
10 Clone DNA59606-1471 has been deposited with ATCC on June 9. 1998. It is understood that the deposited 
clone has the acmal nucleic acid sequence and that the sequences provided herein are based on known 
sequencing techniques. 

Analysis of the amino acid sequence of the full-length PROl 107 polypeptide suggests that it possesses 
significant sequence similarity to phosphodiesterase I/nucleotide phyrophosphaiase, human insulin receptor 
15 tyrosine kinase inhibitor, alkaline phosphodiesterase and auioiaxin, thereby indicating that PROl 107 may have 
at least one or all of the activities of these proteins, and that PROl 107 is a novel phosphodiesterase. More 
specifically, an analysis of the Dayhoff database (version 35.45 SwissProt 35) evidenced sequence identity 
between the PROl 107 amino acid sequence and at least the following Dayhoff sequences: AF005632_1, 
P379148, RNU78787_1, AF060218_4, A57080 and HUMATXT^l. 

20 

EXAMPLE 90: Isolation of cDNA clones Encoding Human PRO 11 40 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
Incyte EST sequence, Incyte cluster sequence No. 135917. This sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g. , GenBank) and a proprietary 

25 EST DNA database (LIFESEQ^", Incyte Pharmaceuticals. Palo Alto, CA) to identify existing homologies. The 
homology search was performed using the computer program BLAST or BLAST2 (Altshul et al.. Methods in 
Enzvmology 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program "phrap" (Phil Green, Univ. of Washington, Seattle, Washington). The consensus sequence obtained 

30 therefrom is herein designated DNA56416. 

In light of an observed sequence homology between DN A564 16 and an EST sequence contained within 
Incyte EST clone no. 3345705, Incyte EST clone no. 3345705 was obtained and its insert sequenced. It was 
found that the insert encoded a full-length protein The sequence, designated herein as DNA59607-1497, which 
is shown in Figure 203. is the full-length DNA sequence for PROl 140. Clone DNA59607-I497 was deposited 

35 widi the ATCC on June 9, 1998, and is assigned ATCC deposit no. 209946. 

The entire nucleotide sequence of DNA59607-1497 is shown in Figure 203 (SEQ ID NO:286). Clone 
DNA59607-1497 contains a single open reading frame with an apparent translational initiation site at nucleotide 
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positions 210-212 and ending at the stop codon at nucleotide positions 975-977 (Figure 203). The predicted 
polypeptide precursor is 255 amino acids long (Figure 204). The ftill-length PROl 140 protein shown in Figure 
204 has an estimated molecular weight of about 29.405 daltons and a pi of about 7.64. Analysis of the full- 
length PROl 140 sequence shown in Figure 204 (SEQ ID NO:287) evidences the presence of three 
transmembrane domains at about amino acids 101 to 118, 141 to 161 and 172 to 191. 

Analysis of the amino acid sequence of the fiUl-lcngth PRO 1 140 polypeptide using the Dayhoff database 
(version 35.45 SwissProt 35) evidenced homology between the PROl 140 amino acid sequence and the following 
Dayhoff sequences: AF023602_1. AF000368_1. CIN3_RAT. AF003373_1. GEN13279, and AF003372_1. 

Clone DNA59607-1497 was deposited with the ATCC on June 9. 1998, and is assigned ATCC deposit 
no. 209946. 



10 



FX AMPLE 91: Isolation of cDNA clones Encoding Hum an PROl 106 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
tocyte EST sequence. This sequence was then compared to a variety of expressed sequence tag (EST) databases 
which included public EST databases (e.g.. GenBank) and a proprietary EST DNA database (LIFESEQ™, Incyte 

15 Riarmaceuacals, Palo Alto. CA) to identify existing homologies. The homology search was performed using 
thecomputerprogramBLASTorBLAST2(Altshuletal.. MPthnrl.; in Figvmologv 266:460-480 (1996)). Those 
comparisons resulting in a BLAST score of 70 (or in some cases 90) or greater that did not encode known 
proteins were clustered and assembled into a consensus DNA sequence with the program "phrap" (Phil Green, 
Univ. of Washington. Seattie, Washington). The consensus sequence obtained therefrom is herein designated 

20 DNA56423. 

In light of an observed sequence homology between DNA56423 and an EST sequence contained within 
Incyte EST clone no. 171 1247, Incyte EST clone no. 171 1247 was obtained and its insert sequenced. It was 
found that the insert encoded a full-length protein The sequence, designated herein as DNA59609-1470. vrtuch 
is shown in Figure 205. is the fuU-lengUi DNA sequence for PROl 106. Clone DNA59609-1470 was deposited 
25 with the ATCC on June 9, 1998. and is assigned ATCC deposit no. 209963. 

The entire nucleotide sequence of DNA59609-1470 is shown in Figure 205 (SEQ ID NO:288). Clone 
DNA59609-1470 contains a single open reading frame wiUt an apparent uranslational initiation site at nucleotide 
positions 61-63 and ending at the stop codon at nucleotide positions 1468-1470 of SEQ ID NO:288 (Figure 205). 
The predicted polypeptide precursor is 469 amino acids long (Figure 206). The fiill-lengdi PROn06 protein 
30 shown in Figure 206 has an estimated molecular weight of about 52.689 daltons and a pl of about 8.68. It is 
understood that the skilled artisan can construct the polypeptide or nucleic acid encoding therefor to exclude any 
one or more of all of these domains. For example, the transmembrane domain region(s) and/or either of the 
amino terminal or cartwxyl end can be excluded. Gone DNA59609-1470 has been deposited with ATCC on 
June 9, 1998. It is understood that die deposited clone has the acnial nucleic acid sequence and that the 
35 sequences provided herein are based on known sequencing techniques. 

Analysis of the amino acid sequence of the full-length PROl 106 polypeptide suggests that it possesses 
significam sequence similarity to the peroxisomal ca-dependera solute carrier, thereby indicating that PROl 106 
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may be a novel transporter. More specifically, an analysis of the Dayhoff database (version 35.45 SwissProt 
35) evidenced sequence identity l^iween the PRO1106 amino acid sequence and at least the following Dayhoff 
sequences, AF004161_U IG{)02N01_25. GDC^BOVIN and BT1_MAIZE. 

EXAMPLE 92: Isolation of cDNA clones Encoding Human PR01291 
S Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST 

cluster sequence from the Incyte database, designated 120480. This EST cluster sequence was then compared 
to a variety of expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) 
and a proprietary EST DNA database (Lifeseq®, hicyte Pharmaceuticals, Palo Alto, CA) to identify existing 
homologies. The homology search was performed using the computer program BLAST or BLAST2 (Altshul 

10 et al.. Methods in Enzvmologv 266:460-480 (1996)). Those comparisons r^ulting in a BLAST score of 70 (or 
in some cases 90) or greater that did not encode known proteins were clustered and assembled into a consensus 
DNA sequence with the program ''phrap" (Phil Green, University of Washington, Seattle, Washington). The 
consensus sequence obtained therefrom is herein designated DNA56425. 

In light of an observed sequence homology between the DNA56425 sequence and an EST sequence 

15 encompassed within the Incyte EST clone no. 2798803, the Incyte EST clone 2798803 was purchased and the 
cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. The 
sequence of this cDNA insert is shown in Figure 207 and is herein designated as DNA5961 0-1556. 

Clone DNA59610-1556 contains a single open reading frame with an apparent translational initiation 
site at nucleotide positions 61-63 and ending at the stop codon at nucleotide positions 907-909 (Figure 207). The 

20 predicted polypeptide precursor is 282 amino acids long (Figure 208). The full-length PR01291 protein shown 
in Figure 208 has an estimated molecular weight of about 30,878 daltons and a pi of about 5.27. Analysis of 
the full-length PR01291 sequence shown in Figure 208 (SEQ ID NO:291) evidences the presence of the 
following: a signal peptide from about amino acid 1 to about amino acid 28, a transmembrane domain from about 
amino acid 258 to about amino acid 281 and potential N-glycosylation sites from about amino acid 1 12 to about 

25 amino acid 1 15, from about amino acid 160 to about amino acid 163, from about amino acid 190 to about amino 
acid 193, from about amino acid 196 to about amino acid 199, from about amino acid 205 to about amino acid 
208, from about amino acid 216 to about amino acid 219 and from about amino acid 220 to about amino acid 
223. . Clone DNA59610-1556 has been deposited with ATCC on June 16, 1998 and is assigned ATCC deposh 
no. 209990. 

30 An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 

alignmem analysis of the full-length sequence shown in Figure 208 (SEQ ID NO:291), evidenced significant 
homology between die PR01291 amino acid sequence and the following Dayhoff sequences: HSU90552_1, 
HSU90 144^1, AF033107_1, HSB73_1, HSU90142^1, GGCD80^1, P_W34452. MOG_MOUSE, B39371 and 
P^R71360. 

35 
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EXAMPLE 93 : Isolation of cDNA clones Encoding Human PRO 1105 

Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST 
cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g. , GenBank) and a proprietary 
EST DNA database (Lifeseq®, Incyte Pharmaceuticals. Palo Alto, CA) to identify existing homologies. The 
5 homology search was performed using the computer program BLAST or BLAST2 (Altshul et al., Methods in 
Enzvmology 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled mto a consensus DNA sequence with 
the program '"phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 
obtained therefrom is herein designated DNA56430. 

10 In light of an observed sequence homology between the DNA56430 sequence and an EST sequence 

encompassed within the Incyte EST clone no. 1853047, the Incyte EST clone 1853047 was purchased and the 
cDNA insen was obtained and sequenced. It was found that this insen encoded a full-length protein. The 
sequence of this cDNA insert is shown in Figure 209 and is herein designated as DNA59612-1466. 

The entire nucleotide sequence of DNA59612-1466 is shown in Figure 209 (SEQ ID NO:292). Clone 

1 S DN A596 12- 1466 contains a single open reading frame with an apparent trahslational initiation site at nucleotide 
positions 28-30 and ending at the stop codon at nucleotide positions 568-570 of SEQ ID NO:292 (Figure 209). 
The predicted polypeptide precursor is 180 amino acids long (Figure 210). The full-length PROl 105 protein 
shown in Figure 210 has an estimated molecular weight of about 20,040 daltons and a pi of about 8.35. Clone 
DNA59612-1466 has been deposited with the ATCC on June 9. 1998. It is understood that the deposited clone 

20 has the actual nucleic acid sequence and that the sequences provided herein are based on known sequencing 
techniques. 

Analyzing Figure 210, a signal peptide is at about amino acids 1-19 of SEQ ID NO:293 and 
transmembrane domains are shown at about amino acids 80-99 and 145-162 of SEQ ID NO:293. It is 
imderstood that the skilled artisan could form a polypeptide with all of or any combination or individual selection 
25 of these regions. It is also understood that the corresponding nucleic acids can be routinely identified and 
prepared based on the information provided herein. 

EXAMPLE 94: Isolation of cDNA clones Encoding Human PR051 1 

Use of Che signal sequence algoridmi described in Example 3 above allowed identification of an EST 

30 cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g. , (jenBank) and a proprietary 
EST DNA database (Lifeseq®, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 
homology search was performed using the computer program BLAST or BLAST2 (Altshul et al.. Methods in 
Enzvmology 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 

35 or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program '^phrap'* (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 
obtained therefrom is herein designated DNA56434. 
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In light of an observed sequence homology between the DN A56434 sequence and an EST sequence 
encompassed within the Incyte EST clone no, 1227491. the Incyie EST clone 1227491 was purchased and the 
cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. The 
sequence of this cDNA insert is shown in Figure 21 1 and is herein designated as DNA59613-1417. 

The entire nucleotide sequence of DNA59613-14I7 is shown in Figure 211 (SEQ ID NO:294). Clone 
5 DNA59613'14I7 contains a single open reading frame with an apparent translational initiation site at nucleotide 
positions 233-235 and ending at the stop codon at nucleotide positions 944-946 (Figure 21 1). The predicted 
polypeptide precursor is 237 amino acids long (Figure 212). The full-length PROS 11 protein shown in Figure 
212 has an estimated molecular weight of about 25,284 daltons and a pi of about 5.74. Clone DNA59613-1417 
has been deposited with the ATCC. Regarding the sequence, it is understood that the deposited clone contains 
10 the correct sequence, and the sequences provided herein are based on known sequencing techniques. 

Analyzing the amino acid sequence of SEQ ID NO:295, the putative signal peptide is at about amino 
acids 1-25 of SEQ ID NO:295. The N-glycosylation sites are at about amino acids 45-48, 73-76, 107-110, 118- 
121, 132-135, 172-175, 175-178 and 185-188 of SEQ ID NO:295. An arthropod defensins conserved region 
is at about amino acids 176-182 of SEQ ID NO:295. A kringle domain begins at about amino acid 128 of SEQ 
15 ID NO:295 and a ly-6/u-PAR domain begins at about amino acid 6 of SEQ ID NO:295. The corresponding 
nucleotides of these amino acid sequences and others can be routinely determined given the sequences provided 
herein. 

The designations appearing in a Dayhoff database with which PROS 1 1 has some sequence identity are 
as foUows: SSC20F10_1; SF041083; P^W26579; S44208; JC2394: PSTA_D1CDI; A27020; S59310: 
20 RAG1_RABIT; and MUSBALBCl.l. 

EXAMPLE 95 : Isolation of cDNA clones Encoding Human PRO! 104 

Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST 
cluster sequence from the Incyie database. This EST cluster sequence was then compared to a variety of 

25 expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) and a prc^rietary 
EST DNA database (Lifeseq*. Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 
homology search was performed using the computer program BLAST or BLAST2 (Altshul et al.. Methods in 
Enzvmoloev 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 

30 the program ^'phrap" (Phil Green, University of Washington, Seaitie. Washington). The consensus sequence 
obtained therefrom is herein designated DNA56446. 

In light of an observed sequence homology between the DN A56446 sequence and an EST sequence 
encompassed within the Incyte EST clone no. 2837496, the Incyte EST clone 2837496 was purchased and the 
cDNA insen was obtained and sequenced. It was found that this insert encoded a full-length protem. The 

35 sequence of this cDNA insen is shown in Figure 213 and is herein designated as DNA59616-1465. 

. The entire nucleotide sequence of DNA59616-1465 is shown in Figure 213 (SEQ ID NO:296). Clone 
DNA59616-1465 contains a single open reading frame with an apparent translational initiation site at nucleotide 
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positions 109>1 1 1 and ending at the stop codon at nucleotide positions 1 132-1 134 of SEQ ID NO:296 (Figure 
213). The predicted polypeptide precursor is 341 amino acids long (Figure 214). The full-length PROl 104 
protein shown in Figure 214 has an estimated molecular weight of about 36,769 daltons and a pi of about 9.03. 
Clone DNA59616-1465 has been deposited with ATCC on June 16, 1998. It is imderstood that the deposited 
clone has the acnial nucleic acid sequence and that the sequences provided herein are based on known 
5 sequencing techniques. 

Analyzing Figure 214, a signal peptide is at about amino acids 1-22 of SEQ ID NO:297. N- 
myristoylation sites are at about amino acids 4M6, 110-115, 133-138, 167-172 and 179-184 of SEQ ID 
NO:297. 

10 EXAMPLE 96: Isolation of cDNA clones Encoding Human PROl 100 

Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST 
cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g. , GenBank) and a proprietary 
EST DNA database (Lifeseq®, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 

15 homology search was performed using the computer program BLAST or BLAST2 (Altshul et al.. Methods in 
Enzvmology 266:460-480 ( 1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program *'phrap'* (Phil Green, University of Washington, Seattle, Washington). 

In light of an observed sequence homology between the obtained consensus sequence and an EST 

20 sequence encompassed within the Incyte EST clone no. 2305379. the Incyte EST clone 2305379 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insen is shown in Figure 215 and is herein designated as DNA59619-1464. 

The entire nucleotide sequence of DNA59619-1464 is shown in Figure 215 (SEQ ID NO:298). Clone 
DNA59619-1464 contains a single open reading frame with an apparent translational initiation site at nucleotide 

25 positions 33-35 and ending at the stop codon at nucleotide positions 993-995 of SEQ ID NO:298 (Figure 215). 
The predicted polypeptide precursor is 320 amino acids long (Figure 216). The full-length PROl 100 protein 
shown in Figure 216 has an estimated molecular weight of about 36.475 daltons and a pi of about 7.29. Clone 
DNA59619-1464 has been deposited with ATCC on July 1, 1998. It is understood that the deposited clone has 
the actual nucleic acid sequence and that the sequences provided herein are based on known sequencing 

30 techniques. 

Upon analyzing SEQ ID NO:299, the approxunate locations of the signal peptide, the transmembrane 
domains, an N-giycosylation site» an N-myristoylation site, a CUB domain and an amiloride-sensitive sodium 
channel domain are present. It is believed that PROl 100 may function as a channel. The corresponding nucleic 
acids for these amino acids and others can be routinely determined given SEQ ID NO:299.. 

35 
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EXAMPLE 97: Isolation of cDNA clones E ncoding Human PR0836_ 

Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST 
cluster sequence from the Incyie database. This EST cluster sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g.. GenBank) and a proprietary 
EST DNA database (Lifeseq*. Incyte Pharmaceuticals. Palo Alto, CA) to identify existing homologies. The 
5 homology search was performed using the computer program BLAST or BLAST2 (Altshul « al.. Methods in 
EnzvmolQgv 266:46(M80 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode knownpn)teins were clustered and assembled intoaconsensus DNA sequence 

the program "phrap" (Phi! Green. University of Washington. SeatUc. Washington). The consensus sequence 
obtained is herein designated DNA564S3. 

10 In Ught of an observed sequence homology between the DNA56453 consensus sequence and an EST 

sequence encompassed within the Incyte EST clone no. 2610075. the Incyte EST clone 2610075 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of diis cDNA msert is shown in Figure 217 and is herein designated as DNA59620-1463. 

The entire nucleotide sequence of DNA5%20-1463 is shown in Figure 217 (SEQ ID NO:300). Clone 

15 DNA59620-1463 contains a single open reading frame with an apparent iranslational initiation site at nucleotide 
positions 65-67 and ending at the stop codon at nucleotide positions 1448- 1450 of SEQ ID NO:300 (Figure 217). 
The predicted polypeptide precursor is 461 amino acids long (Figure 218). The full-length PR0836 protein 
shown in Figure 218 has an estimated molecular weight of about 52,085 daltons and a pi of about 5.36. Analysis 
of the fiiU-length PR0836 sequence shown in Figure 218 (SEQ ID NO:301) evidences the presence of the 

20 following: a signal peptide. N-glycosylation sites. N-myristoylation sites, a domain conserved in the 
YJL126wA'LR351c/yhcX family of proteins, and a region having sequence identity with SLSl. Clone 
DNA59620-1463 has been deposited with ATCC on June 16. 1998. It is understood that the deposited clone 
has the acnial nucleic acid sequence and that the sequences provided herein are based on known sequencing 
techniques. 

25 Analysis of the amino acid sequence of the full-length PR0836 polypeptide suggests that it possesses 

some sequence simUarity to SLSl. thereby indicating that PR0836 may be involved in protein translocation of 
the ER. More specifically, an analysis of the Dayhoff database (version 35.45 SwissProt 35) evidenced some 
homology between the PR0836 amino acid sequence and at least the following Dayhoff sequences. S58132. 
SPBC3B9_1. S66714, CRU40057_1 and IMA_CAEEL. 

30 

EXAMPLE 98: Isolation of cDNA clones Encoding Hu man PR01141 

Use of the signal sequence algorithm described in Example 3 above allowed identification of an EST 
cluster sequence from the Incyte database, designated 1 1873. This EST cluster sequence was then compared 
to a variety of expressed sequence ug (EST) databases which included public EST daubascs (e.g.. GenBank) 
35 and a proprietary EST DNA database (LIFESEQ*. Incyte Pharmaceuticals. Palo Alto. CA) to identify existing 
homologies. The homology search was performed using the computer program BLAST or BLASH (Altshul 
ct al.. Methods in Enzvmologv 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or 
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in some cases 90) or greater that did not encode known proteins were clustered and assembled into a consensus 
DNA sequence with the program •*phrap" (Phil Green, University of Washington, Seanie, Washington). The 
consensus sequence obtained therefrom is herein designated DNA56518. 

In light of an observed sequence homology between the DNA56518 consensus sequence and an EST 
sequence encompassed within the Incyie EST clone no. 2679995, the hicyte EST clone 2679995 was purchased 

5 and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insen is shown in Figure 219 and is herein designated as DNA59625-1498. 

Clone DNAS9625-1498 contains a single open reading frame with an apparent translational initiation 
site at nucleotide positions 204-206 and ending at the stop codon at nucleotide positions 945-947 (Figure 219). 
The predicted polypeptide precursor is 247 amino acids long (Figure 220). The full-length PROl 141 protein 

10 shown in Figure 220 has an estimated molecular weight of about 26,840 daltons and a pi of about 8. 19. Analysis 
of the full-length PROl 141 sequence shown in Figure 220 (SEQ ID NO:303) evidences the presence of the 
following: a signal peptide from about amino acid 1 to about amino acid 19 and transmembrane domains from 
about amino acid 38 to about amino acid 57, firom about amino acid 67 to about amino acid 83, from about 
amino acid 117 to about amino acid 139 and from about amino acid 153 to about amino acid 170. Clone 

15 DNA59625-1498 has been deposited with ATCC on June 16, 1998 and is assigned ATCC deposit no. 209992. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the fiill-length sequence shown in Figure 220 (SEQ ID NO: 303), evidenced significant 
homology between the PROl 141 amino acid sequence and the following Dayhoff sequences: CEVF36H2L_2, 
PCRB7PRJ_1, AB000506_1. LEU95008_1, MRU87980_15. YIGM_ECOLI, STU65700_1, GHU62778_1, 

20 CYST^SYNY3 and AF009567_1 . 

EXAMPLE 99 : Isolation of cDNA clones Encoding Human PROl 132 

A consensus DNA sequence was assembled relative to other EST sequences using phrap as described 
in Example 1 above. This consensus sequence is designated herein as DNA35934. Based on the DNA35934 
25 consensus sequence, oligonucleotides were synthesized: 1) to identify by PCR a cDNA library that contained 
the sequence of interest, and 2) for use as probes to isolate a clone of the fiill-length coding sequence for 
PROl 132. 

PCR primen (forward and reverse) were synthesized: 
forward PCR primer : 5'-TC<rrGTGACCACCCCTCTAACACC.3* (SEQ ID NO:310) and 
30 reverse PCR primer : 5'-CTGGAACATCTGCTGCCCAGATTC-3' (SEQ ID N0:31 1). 

Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus 
sequence which had the following nucleotide sequence: 

5 .GTCGGATGACAGCAGCAGCCGCATCATCAATCK3ATCCGACTGCGATATGC-3' (SEQ ID NO:312). 
In order to screen several libraries for a source of a full-length clone, DNA from the libraries was 
35 screened by PCR amplification with the PCR primer pair identified above. A positive library was then used to 
isolate clones encoding the PROl 132 gene using the probe oligonucleotide and one of the PCR primers. RNA 
for construction of the cDNA libraries was isolated from human fetal kidney. 
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DNA sequencing of the clones isolaied as described above gave the fuU-lengih DNA sequence for 
PR01132 and the derived proiein sequence for PROl 132. 

The entire nucleotide sequence of PROl 132 is shown in Figure 225 (SEQ ID NO:308). Clone 
DNA59767-1489 contains a single open reading frame with an apparent translational initiation site at nucleotide 
positions 354-356 and a stop codon at nucleotide positions 1233-1235 (Figure 225; SEQ ID NO:308). The 
5 prediaed polypeptide precursor is 293 amino acids long. The signal peptide is at about amino acids 1-22 and 
tiie histidine active site is at about amino acids 104-109 of SEQ ID NO:309. Clone DNA59767-1489 has been 
deposited with ATCC (having the acmal sequence rather than representations based on sequencing techniques 
as presented herein) and is assigned ATCC deposit no. 203108. The full-length PROl 132 protein shown in 
Figure 226 has an estimated molecular weight of about 32,020 daltons and a pi of about 8.7. 
10 An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 

alignment analysis of the full-length sequence shown in Figure 226 (SEQ ID NO:309), revealed sequence identity 
between the PROl 132 amino acid sequence and the following Dayhoff sequences: SSU76256_1, P_W10694, 
MMAE000663_6. AF013988J, U66061J, MMAE000665_2, MMAE00066415, MMAE00066414, 
MMAE000665_4 and MMAE00066412. 

15 

EXAMPLE 100 : Isolation of cDNA clones Encoding Human NL7 (PRO 1346) 

A single EST sequence (#1398422) was found in the LIFESEQ* database as described in Example I 
above. This EST sequence was renamed as DNA45668. Based on the DNA45668 sequence, oligonucleotides 
were synthesized: 1) to identify by PCR a cDNA library that contained the sequence of interest, and 2) for use 
20 as probes to isolate a clone of the full-length coding sequence for NL7. 

PCR primers (forward and reverse) were synthesized: 
forward PCR primer : 5 -CACACGTCCAACCTCAAT(3GGCAG-3' (SEQ ID NO:315) 
reverse PCR primer : 5 -GACCAGCAGGGCCAAGGACAAGG.3' (SEQ ID N0:316) 

Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus 
25 DNA45668 sequence which haid the following nucleotide sequence: 
hvbridization probe: 

5-GTTCTCTGAGATGAAGATCCGGCCGGTCCGGGAGTACCGCTTAG-3' 
(SEQIDNO:317) 

In order to screen several libraries for a source of a full-length clone, DNA from the libraries was 
30 screened by PCR amplification with the PCR primer pair identified above. A positive library was then used to 
isolate clones encoding the NL7 gene using the probe oligonucleotide and one of the PCR primers. RNA for 
construction of the cDNA libraries was isolated from a human fetal kidney library (L1B227). 

DNA sequencing of the clones isolated as described above gave the fulMength DNA sequence for NL7 
(designated herein as DNA59776-1600 [Figure 227, SEQ ID N0:3 13]) and the derived proiein sequence for NL7 
35 (PR01346). 

The entire coding sequence of NL7 (PR01346) is shown in Figure 227 (SEQ ID N0:313). Clone 
DNA59776-1600 contains a single open reading frame with an apparent translational initiation site at nucleotide 
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positions 1-3 and an apparent stop codon at nucleotide positions 1384-1386. The predicted polypeptide precursor 
is 46 1 amino acids long. The protein contains an apparent type 11 transmembrane domain at amino acid positions 
from about 31 to about 50; fibrinogen beta and gamma chains C-terminal domain signature starting at about 
amino acid petition 409« and a leucine zipper paaem starting at about ammo acid positions 140, 147, 154 and 
161, respectively. Clone DNA59776-1600 has been deposited with ATCC and is assigned ATCC deposit no. 
5 203128. The full-length NL7 protein shown in Figure 228 has an estimated molecular weight of about 50,744 
daltons and a pi of about 6.38. 

Based on a WU-BLAST2 sequence alignment analysis (using the WU-BLAST2 computer program) of 
the full-length sequence, NL7 shows significant amino acid sequence identity to a htmian microfibril-associated 
glycoprotein (1 MFA4_HUMAN); to known TIE-2 ligands and ligand homoiogues, ficolin, serum lectin and 
10 TGF-1 binding protein. 

EXAMPLE 101: Isolation of cDNA clones Encoding Human PR01131 

A cDNA sequence isolated in the amylase screen described in Example 2 above is herein designated 
DNA43546 (see Figure 231; SEQ ID NO:320). The DNA43546 sequence was then compared to a variety of 

IS expressed sequence tag (EST) databases which included public EST databaises (e.g. , GenBank) and a proprietary 
EST DNA database (LIFESEQ™, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 
homology search was performed using the computer program BLAST or BLAST2 (Altshul et al.. Methods in 
Enzvmology 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled into consensus DNA sequences with 

20 the program "phrap'' (Phil Green, University of Washington, Seattle. Washington). The consensus sequence 
obtained therefrom is herein designated DNA45627. 

Based on the DNA45627 sequence, oligonucleotide probes were generated and used to screen a human 
library prepared as described in paragraph 1 of Example 2 above. The cloning vector was pRK5B (pRKSB is 
a precursor of pRK5D that does not contain the Sfil site; see. Holmes et al.. Science 253:1278-1280 (1991)), 

25 and the cDNA size cut was less than 2800 bp. 

PGR primers (forward and 2 reverse) were synthesized: 
forward PGR primer S'-ATGCAGGCCAAGTACAGCAGCAC-3' (SEQ ID NO:321); 
reverse PGR primer 1 5*-GATGCTGAGGAGTTCCTGCAAGG-3* (SEQ ID NO:322); and 
reverse PGR primer 1 5'^GAGAGAGTCrrGTGC 11 C 11 GGG-3' (SEQ ID NO:323) 

30 Additionally, a synthetic oligonucleotide hybridization probe was constructed from the DNA45627 

sequence which had the following nucleotide sequence: 
hybridization probe 

5 -ATGGTGGATGATGATGGGGAGAGGAGGATGAGGCTGGATT.3* (SEQ ID NO:324). 

In order to screen several libraries for a source of a full-length clone, DNA from the libraries was 
35 screened by PGR amplification with die PGR primer pair identified above. A positive library was then used to 
isolate clones encoding the PROl 131 gene using the probe oligonucleotide and one of the PGR primers. 
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A full length clone was identified that contained a single open reading frame with an apparent 
translaiional initiation site at nucleotide positions 144-146, and a stop signal at nucleotide positions 984-986 
(Figure 229; SEQ ID NO:318). The predicted polypeptide precursor is 280 amino acids long, has a calculated 
molecular weight of approximately 31,966 daltons and an estimated pi of approximately 6.26. The 
transmembrane domain sequence is at about 49-74 of SEQ ID NO:319 and the region having sequence identity 
5 with LDL receptors is about 50-265 of SEQ ID NO:319- PROI 131 contains potenual N-linked glycosylation 
sites at amino acid positions 95-98 and 169-172 of SEQ ID NO:319. Clone DNA59777-1480 has been deposited 
with the ATCC and is assigned ATCC deposit no. 203 1 1 1 . 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown inpigure 230 (SEQ ID N0:3 19). evidenced some sequence 
10 identity between the PROI 131 amino acid sequence and the following Dayhoff sequences: AB0107i0_! , 149053. 
I49I15, RNU56863_I, LY4A_M0USE, 155686. MMU56404_1, 149361, AF030313_1 and MMU09739_l. 

EXAMPLE 102: Isolation of cDNA clones Encoding Human PRO 1281 

A consensus DNA sequence was assembled relative to other EST sequences using phrap as described 
15 in Example 1 above. This consensus sequence is designated herein as DNA35720. Based on the DNA35720 

sequence, oligonucleotides were synthesized: 1) lo identify by PCR a cDNA library that contained the sequence 

of interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for PR01281. 
PCR primers (forward and reverse) were synthesized: 

forward PCR primers : 
20 5*-TGGAAGGCTGCCGCAACGACAATC-3' (SEQ ID NO:327); 

5'-CTGATGTGGCCGATGTTCTG-3* (SEQ ID NO:328); and 

5 -ATGGCTCAGTGTGCAGACAG-3* (SEQ ID NO:329). 

reverse PCR primers: 

5 -GCATGCTGCTCCGTGAA(3TAGTCC-3* (SEQ ID NO:330); and 
25 5 -ATGCATGGGAAAGAAGGCCTGCCC-3* (SEQ ID NO:33 1). 

Additionally, a synthetic oligonucleotide hybridization probe was constructed from the DNA35720 sequence 
which had the following nucleotide sequence: 
hvbridization probe: 

5*-TGCACTGGTGACCACGAGGGGGTGCACTATAGCCATCTGGAGCTGAG-3' (SEQ ID NO:332). 
30 In order to screen several libraries for a source of a fuH-length clone, DNA from the libraries was 

screened by PCR amplification with the PCR primer pairs idemified above. A positive library was then used 

to isolate clones encoding the PR01281 gene using the probe oligonucleotide and one of die PCR primers. RNA 

for construction of the cDNA libraries was isolated human fetal liver. 

DNA sequencing of the clones isolated as described above gave the full-lengdi DNA sequence for 
35 PR01281 (designated herein as DNA59820-1549 (Figure 232, SEQ ID N0:3251; and die derived protein 

sequence for PR01281. 



459 



awcrv-v*ir> ^\«r» 



wo 99/63088 



PCT/US99/12252 



TheentirecodingsequenceofPR01281 is shown in Figure 232 (SEQ ID NO:325). Clone DNA59820. 
1549 coniains a single open reading frame with an apparent iranslational initiation site at nucleotide positions 
228-230 and an apparent stop codon at nucleotide positions 2553-2555. The predicted polypeptide precursor 
is 775 amino acids long. The full-length PR01281 protein shown in Figure 233 has an esumaied molecular 
weight of about 85,481 daltons and a pi of about 6.92. Additional features include a signal peptide at about 

5 amino acids 1-15; and potenual N-glycosylaiion sites at about amino acids 138-141 ami 361-364. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 233 (SEQ ID NO:326), revealed some sequence 
identity between the PR01281 amino acid sequence and ibe following Dayhoff sequences: S44860, CET24D1_1 . 
CEC38H2_3, CAC2,HAEC0. B3A2^HUMAN. S22373. CEF38A3J. CEC34F6_2, CEC34F6J, and 

10 CELT22B11_3. 

Clone DNA59820-1549 has been deposited with ATCC and is assigned ATCC deposit no. 203129. 

EXAMPLE 103: Isolation of cDNA clones Encoding Human PRO 1064 

A cDN A sequence isolated in the amylase screen described in Example 2 above was found, by the WU- 

15 BLAST2 sequence aiignment computer program, to have no significant siequence identity to any known human 
protein. This cDNA sequence is herein designated DNA45288. The DN A45288 sequence was then compared 
to various EST databases including public EST databases (e.g., GenBank), and a proprietary EST database 
(LIFESEQ* Incyte Pharmaceuticals, Palo Alto, CA) to identify homologous EST sequences. The comparison 
was performed using the computer program BLAST or BLAST2 |Altschul et al.. Methods in EnTymolo^y, 

20 266:460-480 (1996)]. Those comparisons resulting in a BLAST score of 70 (or in some cases, 90) or greater 
that did not encode known proteins were clustered and assembled into a consensus DNA sequence with the 
program "phrap" (Phil Green, University of Washington, Seattle, Washington). This consensus sequence is 
herein designated DNA48609. Oligonucleotide primers based upon the DNA48609 sequence were then 
synthesized and employed to screen a human fetal kidney cDN A library which resulted in the identification of 

25 the DNA59827-1426 clone shown in Figure 234. The cloning vector was pRK5B (pRK5B is a precursor of 
pRK5D that does not contam the Sfil site; see. Holmes et al.. Science , 253:1278-1280 (1991)), and the cDNA 
size cut was less than 2800 bp. 

The oligonucleotide probes employed were as follows: 
forward PGR primer 5'-CTGAGACCCTGCAGCACCATCTG-3' (SEQ ID NO:336) 
30 reverse PGR primer 5*-GGTGCTTCTTGAGCCCCACTTAGC-3' (SEQ ID NO:337) 

Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus 
DN A48609 sequence which had the following nucleotide sequence 

^ hvbridization probe 

5'.AATCTAGCTTCTCCAGGACTGTGGTCGCCCCGTCCG<rrGT.3' (SEQ ID NO:338) 
35 A full length clone was identified that contained a single open reading frame with an apparent 

translational initiation site at nucleotide positions 532-534 and a stop signal at nucleotide positions 991-993 
(Figure 234, SEQ ID NO:333). The predicted polypeptide precursor is 153 amino acids long, has a calculated 
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molecular weight of approximately 17,3 17 daltons and an estimated pi of approximately 5.17. Analysis of the 
full-length PRO1064 sequence shown in Figure 235 (SEQ ID NO:334) evidences the presence of the following: 
a signal peptide from about amino acid 1 to about amino acid 24, a transmembrane domain from about amino 
acid 89 to about amino acid 1 10» an indole-3-glycerol phosphate synthase homology block from about amino acid 
74 to about amino acid 105 and a Myb DNA binding domain protein repeal protein homology block from about 
5 amino acid 1 14 to about amino acid 137. Clone DNA59827- 1426 has b^n deposited with ATCC on August 4, 
1998 and is assigned ATCC deposit no. 203089. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35). using a WU-BLAST2 sequence 
alignment analysis of the M-lengih sequence shown in Figure 235 (SEQ ID NO:334), evidenced homology 
between the PRO1064 amino acid sequence and the following Dayhoff sequences: MMNP15PRO_l, 
10 BP187PLYH^1,CELF42G8J,MMU58888^1,GEN14270,TUB8_SOLTU,RCN_MOUSE.HUMRBSY7 

SESENODA_l and A21467_l. 

EXAMPLE 104: Isolation of cDNA clones Encoding Human PRO 1379 

A consensus DNA sequence was assembled relative to other EST sequences using phrap as described 
15 in Example 1 above. This consensus sequence is designated herein DNA45232. Based on the DNA45232 
consensus sequence, oligonucleotides were synthesized: 1) to identify by PCR a cDNA library that contained 
the sequence of interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for 
PRO 1379, 

PCR primers (forward and reverse) were synthesized: 
20 forward PCR primer 5'>TGGACACCGTACCCTGGTATCTGCO' (SEQ ID NO:34 1) 
reverse PCR primer 5'-CCAACTCTGAGGAGAGCAAGTGGC-3' (SEQ ID NO:342) 

Additionally, a synthetic oligonucleotide hybridizaiion probe was constructed from the consensus 
DNA45232 sequence which had the following nucleotide sequence: 
hvbridizauon probe 

25 5*-TGTATGTGCACACCCTCACCATCACCTCCAAGG(K:aAGGAGAAC-3' (SEQ ID NO:343). 

In order to screen several libraries for a source of a full-length clone, DNA from the libraries was 

screened by PCR amplification with the PCR primer pair identified above. A positive library was then used to 

isolate clones encoding the PR01379 gene using the probe oligonucleotide and one of the PCR primers. RNA 

for construction of the cDNA libraries was isolated human fetal kidney tissue. 
30 DNA sequencing of the clones isolated as described above gave the fiill-length DNA sequence for 

PR01379 which is designated herein as DNA59828-1608 and shown in Figure 237 (SEQ ID NO:339); and the 

derived protein sequence for PROI379 (SEQ ID NO:340). 

The entire coding sequence of PR01379 is shown in Figure 237 (SEQ ID NO:339). Clone DNA59828- 

1608 contains a single open reading frame with an apparent translational initiation site at nucleotide positions 
35 10-12 and an apparent stop codon at nucleotide positions 1732-1734. The predicted polypeptide precursor is 574 

amino acids long. The full-lengdi PR01379 protein shown in Figure 238 has an estimated molecular weight of 

about 65,355 daltons and a pi of about 8.73. Additional feamres include a signal peptide at about amino acids 
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1-17 and poiential N-glycosyladon sites at about amino acids 160-163, 287-290, and 323-326. 

An analysis of the Dayboff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of ihe full-length sequence shown in Figure 238 (SEQ ID NO:340), revealed some homology 
between the PR01379 amino acidsequenceand the following Dayhoff sequences: YHY8_YEAST, AF040625_1, 
HP714394^1. and HIV18U45630,1. 
5 Clone DNA59828-I608 has been deposited with ATCC and is assigned ATCC deposit no. 203158. 

EXAMPLE 105: Isolation of cDNA Clones Encoding Human PRQ844 

An wq)ressed sequence tag (EST) DNA database (UFESEQ™, Incyte Pharmaceuticals, Palo Alto. CA) 
was searched and an EST was identifled which showed sequence identity with aLP. Based on the information 
10 and discoveries provided herein, die clone for this EST, Incyte clone no. 2657496 from a cancerous lung library 
was further examined. 

DNA sequencing of the insert for this clone gave a sequence (herein designated as DNA59838-1462; 
SEQ ID NO:344) which includes the full-length DNA sequence for PR0844 and the derived protein sequence 
for PR0844. 

15 The entire nucleotide sequence of DNA59838-1462 is shown in Figure 239 (SEQ ID NO:344). Clone 

DNA59838-1462 contains a single open reading frame with an apparent translaiional initiation site at nucleotide 
positions 5-7 and ending at the stop codon at nucleotide positions 338-340 of SEQ ID NO:344 (Figure 239). 
The predicted polypeptide precursor is 111 amino acids long (Figure 240). The full-length PR0844 protein 
shown in Figure 240 has an estimated molecular weight of about 12,050 dalions and a pi of about5.45. Clone 

20 UNQ544 DNA59838-1462 has been deposited with ATCC on June 16, 1998. It is understood that the deposited 
clone has the actual nucleic acid sequence and that the sequences provided herein are based on known 
sequencing techniques. 

Analysis of the amino acid sequence of the full-length PR0844 polypeptide suggests that it possesses 
significant sequence similarity to serine protease inhibitors, thereby indicating that PR0844 may be a novel 
25 proteinase inhibitor. More specifically, an analysis of die Dayhoff database (version 35.45 SwissProt 35) 
evidenced significant homology between the PR0844 amino acid sequence and at least the following Dayhoff 
sequences, ALK1_.HUMAN, P_P82403. P_P82402, ELAF^HUMAN and P_P60950. 

EXAMPLE 106: Isolation of cDNA Clones Encoding Human PR0848 

30 Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 

EST cluster sequence from the Incyte database. This EST cluster sequence was dien compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g. , GenBank) and a proprietary 
EST DNA database (UFESEQ*. Incyte Phannaceuticals, Palo Alto, CA) to identify existing homologies. The 
homology search was performed using die computer pn^m BLAST or BLAST2 (Altshul et al., Methods in 

35 Enzymology 266:460-480 ( 1 996)) . Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not eiuxxie known proteins were clustered and assembled into a consensus DNA sequence with 
the program *'phrap" (Phil Green, University of Washington. Seattle, Washington). The consensus sequence 

462 



c»jcrw-v~trv- ^wo 



wo 99/63088 n^^.^^^^ 

PCT/US99/12252 

obtained therefrom is herein designated DNA55999. 

In light of an observed sequence homology between the DNA55999 consensus sequence and an EST 
sequence encompassed within the Incyie EST clone no. 2768571, the Incyie EST clone 2768571 was purchased 
and the cDNA insen was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insen is shown in Figure 241 and is herein designated as DNA59839-1461. 
5 The entire nucleotide sequence of DNA59839-1461 is shown in Figure 241 (SEQ ID NO:346). Clone 

DNA59839-1461 contains a single open reading frame with an apparent uranslauonal initiation site at nucleotide 
positions 146-148 and ending at the stop codon at nucleotide positions 1946-1948 of SEQ ID NO:346 (Figure 
241). The predicted polypeptide precursor is 600 amino acids long (Figure 242). The fiiU-lengih PR0848 
protein shown in Figure 242 has an estimated molecular weight of about 68,536 daltons. Clone DNA59839-1461 

10 has been deposited with ATCC on June 16, 1998. It is understood that the deposited clone has the actual nucleic 
acid sequence and that the sequences provided herein are based on known sequencing techniques. 

Analysis of the amino acid sequence of the fiill-length PR0848 polypeptide suggests that it may be a 
novel sialyhransferase. More specifically, an analysis of the Dayhoff database (version 35.45 SwissProt 35) 
evidenced sequence identity between the PR0848 amino acid sequence and at least the following Dayhoff 

15 sequences, P^R78619 (GalNAc-alpha-2, 6-sialyltransferase). CAAG5^CHICK (alpha-n-aceiylgalactosamide 
alpha-2,6-sialytransferase),HSU14550_l,CAG6_HUMANand P^R632I7 (humanalpha-2, 3.sialyltransferase). 

EXAMPLE 107: Isolation of cDNA Clones Encoding Human PRO 1097 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 

20 EST cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g. , GenBank) and a proprietary 
EST DNA database (LIFESEQ* Incyte Pharmaceuticals, Palo Alto. CA) to identify existing homologies. The 
homology search was performed using the computer program BLAST or BLAST2 (Altshul et al. , Methods in 
Em^ynology 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 

25 or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence widi 
the program **phrap- (PhU Green, University of Washington, Seattle, Washington). The consensus sequence 
obtained therefrom is herein designated DNA56006. 

In light of an observed sequence homology between the DNA56006 consensus sequence and an EST 
sequence encompassed within the Incyie EST clone no. 2408105, the Incyte EST clone 2408105 was purchased 

30 and the cDNA insen was obtained and sequenced. It was found that this insen encoded a full-length protein. 
The sequence of this cDNA insen is shown in Figure 243 and is herein designated as DNA5984M460. 

The entire nucleotide sequence of DNA5984 1-1460 is shown in Figure 243 (SEQ ID NO:348). Clone 
DNA5984 1-1460 contains a single open reading frame with an apparent translaiional initiation site at nucleotide 
positions 3-5 and ending at the stop codon at nucleotide positions 276-278 of SEQ ID NO:348 (Figure 243). 

35 The predicted polypeptide precursor is 91 amino acids long (Figure 244). The full-length PRO1097 protein 
shown in Figure 244 has an estimated molecular weight of about 10.542 daltons and a pi of about 10.04. Clone 
DNA59841.1460hasbeendepositedwiihATCConJuly 1, 1998. It is understood that the deposited clone has 

463 



wsoocrf> <wo 



WO99/d3088 



PCT/US99/12252 



the actual nucleic acid sequence and that the sequences provided herein are based on known sequencing 
techniques. 

Analyzing Figure 244, the signal peptide is at about amino acids 1-20 of SEQ ID NO:349. The 
glycoproiease family protein domain staru at about amino acid 56, and the acyliransferase ChoActase/COT/CPT 
family peptide starts at about amino acid 49 of SEQ ID NO:349. 

5 

EXAMPLE 108: Isolation of cDNA clones Encoding Human PRO! 153 

Use of the signal sequience algorithm described in Example 3 above allowed identification of a single 
EST cluster sequence from the Incyte database. This EST cluster sequence was rfien compared to a variety of 
e}q>ressed sequence tag (EST) databases which included public EST databases (e.g. , GenBank) and a proprietary 
10 EST DNA database (LIFESEQ^, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 
homology search was performed using the computer program BLAST or BLAST2 (Altshul et al.. Methods in 
Enzymology 266:46(M8Q (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program "phrap" (Phil Green, University of Washington, Seattle. Washington). The consensus sequence 
15 obtained therefrom is herein designated DNA56008. 

In light of an observed sequence homology between the DNA56008 consensus sequence and an EST 
sequence encompassed within the Incyte EST clone no. 2472409, the Incyte EST clone 2472409 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insen encoded a full-length protein. 
The sequence of this cDNA insen is shown in Figure 245 and is herein designated as DNA59842-1502. 

20 The full length clone shown in Figure 245 contained a single open reading frame with an apparent 

translational initiation site at nucleotide positions 92-94 and ending at the stop codon found at nucleotide positions 
683-685 (Figure 245; SEQ ID NO:350). The predicted polypeptide precursor (Figure 246, SEQ ID NO:351) 
is 197 amino acids long. PROl 153 has a calculated molecular weight of approximately 2 1 .540 daltons and an 
estimated pl of approximately 8.31 . Clone DNA59842-1502 has been deposited with ATCC and is assigned 

25 ATCC deposrt no. 209982. It is understood that the correct and acnial sequence is in the deposited clone while 
herein are present representations based on current sequencing techniques which may have minor errors. 

Based on a WU-BLAST2 sequence alignment analysis (using the ALIGN computer program) of the full- 
length sequence, PROl 153 shows some ammo acid sequence identity to the following Dayhoff designations: 
S57447; SOYHRGPC_l: S46965; P_P82971; VCPHEROPH^l; EXTN_TOBAC; MLCB2548_9; 

30 ANXA_RABIT; JC5437 and SSGP^VOLCA. 

EXAMPLE 109! Isolation of cDNA clones Encoding Human PROl 154 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
EST cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 
35 expressed sequence tag (EST) databases which included public EST databases (e.g. . GenBank) and a proprietary 
EST DNA database (UFESEQ* Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 
homology search was performed using the computer program BLAST or BLAST2 (Altshul et al.. Methods in 
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EnzYmology 266:460480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program "phrap" (Phil Green. University of Washington, SeatUe, Washington). The consensus sequence 
obtained therefrom is herein designated DNA56025. 

In light of an observed sequence homology between the DNA56025 consensus sequence and an EST 
5 sequence encompassed within the Incyie EST clone no. 2169375, the Incyte EST clone 2 169375 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 247 and is herein designated as DNA59846-1503. 

The full length clone shown in Figure 247 coniamed a single open reading frame with an apparent 
translational iniuaiion site at nucleotide positions 86-88 and ending at the stop codon found at nucleotide positions 
10 2909-291 i (Figure 247; SEQ ID NO:352). The predicted polypeptide precursor (Figure 248, SEQ ID NO:353) 
is 941 amino acids long. PROI 154 has a calculated molecular weight of approximately 107, 144 daltons and an 
estimated pl of approximately 6.26. Clone DNA59846-1503 has been deposited with ATCC and is assigned 
ATCC deposit no. 209978. 

Based on a WU-BLAST2 sequence alignment analysis (using the ALIGN computer program) of the full- 
15 length sequence, PROl 154 shows sequence identity to at least the following Dayhoff designations: ABOl 1097 1 . 
AMPN_HUM AN, RNU76997_1 . 15933 1 . GEN 14047, HSU62768^1 , P_R5 128 1 , CET07F10 1 , SSU66371_1 , 
and AMPRE_HUMAN. 

EXAMPLE 110! Isolation of cDNA clones Encoding Human PROllSI 
20 Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 

EST cluster sequence from the Incyte database, designated herein as 82468, This EST cluster sequence was then 
compared to a variety of expressed sequence tag (EST) databases which included public EST databases (e.g., 
GenBank) and a proprietary EST DNA database (LIFESEQ*. Incyte Pharmaceuticals, Palo Alto, C A) to identily 
existing homologies. The homology search was performed using the computer program BLAST or BLAST2 
(Altshul et al.. Methods in EnTymolo^ 266:460-480 (1996)). Those comparisons resulting in a BLAST score 
of 70 (or in some cases 90) or greater diat did not encode known proteins were clustered and assembled into a 
consensus DNA sequence with the program -phrap" (Phil Green, University of Washington, Seattle, 
Washington). The consensus sequence obtained therefrom is herein designated DNA56029. 

In light of an observed sequence homology between the DN A56029 consensus sequence and an EST 
sequence encompassed within the Incyte EST clone no. 2186536. the Incyie EST clone 2186536 was purchased 
and the cDNA insen was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 249 and is herein designated as DNA59847-151 1 . 

Clone DNA59847-1511 contains a single open reading frame with an ^parent translational initiation 
site at nucleotide positions 17-19 and ending at the stop codon at nucleotide positions 1328-1330 (Figure 249). 
The predicted polypeptide precursor is 437 amino acids long (Figure 250). The full-length PROl 181 protein 
shown m Figure 250 has an estimated molecular weight of about 46.363 daltons and a pi of about 6.22. Analysis 
of the fiill-iength PROl 181 sequence shown in Figure 250 (SEQ ID NO:355) evidences die presence of the 
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following: a signal peptide from about amino acid 1 to about amino acid 15. potential N-glycosylation sites from 
about amino acid 46 to about amino acid 49. from about amino acid 189 to about amino acid 192 and from about 
amino acid 382 to about amino acid 385 and amino acid sequence blocks having homology to Ly-6/u-PAR 
domain proteins from about amino acid 287 to about amino acid 300 and from about amino acid 98 to about 
amino acid 111. Clone DNA59847-151 1 has been deposited with ATCC on August 4. 1998 and is assigned 
S ATCC deposu no. 70X98. 

An analysis of the Dayhpff database (version 35.45 SwissPiot 35). using a WU-BLAST2 sequence 
alignment analysis of the fhU-length sequence shown in Figure 250 (SEQ ID NO:355). evidenced homology 
between the PR01181 amino acid sequence and the following Dayhoff sequences: AF041083 1, P_W26579. 
RNMAGPIAN.l. CELT13C2_2. LMSAP2GN_1. S61882. CEF35C5_12. DP87_DICDI. GIU47631 1 and 
10 P_R07092. 

EXAMPLE \ 1 1.: Isolation of c DNA clones Encoding Human PROl 182 

Use of the signal sequence algorithm described in Example 3 above allowed idemification of a single 
EST cluster sequence from the Incyte database, designated herein as 146647. This EST cluster sequence was 
15 then compared to a variety of expressed sequence tag (EST) databases which mcluded public EST databases 
(e.g.. GenBank) and a proprietary EST DNA database (LIFESEQ* Incyte Pharmaceuticals. Palo Alto. CA) to 
identify existing homologies. The homology search was performed using the computer program BLAST or 
BLAST2 (Altshul et al.. Methods in Enzymolory 266:460-480 (1996)). Those comparisons resulting in a 
BLAST score of 70 (or in some cases 90) or greater diat did not encode known proteins were clustered and 
20 assembled imo a consensus DNA sequence with the program "phrap" (Phil Green. University of Washington. 
Seattle. Washington). The consensus sequence obtained therefrom is herein designated DNA56033. 

In light of an observed sequence homology between the DNA56033 consensus sequence and an EST 
sequence encompassed within the Incyte EST clone no. 2595195. the Incyte EST clone 2595195 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insen encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 251 and is herein designated as DNA59848-I512. 

Clone DNA59848-1512 contains a single open reading frame with an apparent translational initiation 
site at nucleotide positions 67-69 and ending at the stop codon at nucleotide positions 880-882 (Figure 251). Tbe 
predicted polypeptide precursor is 271 amino acids long (Figure 252). The full-length PROl 182 protein shown 
in Figure 252 has an estimated molecular weight of about 28.665 daltons and a pi of about 5.33. Analysis of 
30 the ftiU-length PROl 182 sequence shown in Figure 252 (SEQ ID NO:357) evidences the presence of the 
foUowing: a signal peptide from about amino acid I to about amino acid 25. an amino acid block having 
homology to C-type lectin domain proteins from about amino acid 247 to about amino acid 256 and an amino 
acid sequence block having homology to Clq domain proteins from about amino acid 44 to about amino acid 
77. Clone DNA59848-1512 has been deposited with ATCC on August 4. 1998 and is assigned ATCC deposit 
35 no. 203088. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35). using a WU-BLAST2 sequence 
alignmem analysis of the full-length sequence shown in Figure 252 (SEQ ID NO:357). evidenced significant 
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homology between the PROl 182 amino acid sequence and the following Dayhoff sequences: PSPD^BOVIN, 
CU3^B0VIN. CONG^BOVIN, P_^W18780. P_R45005, P^R53257 and CELEGAPT^l. 

EXAMPLE 112 : Isolation of cDNA clones Encoding Human PROl 155 

Use of the signal sequence algorithm described in Example 3 above allowed ideniificaiion of a single 
5 EST cluster sequence from the Incyie database. This EST cluster sequence was then compared to a variety of 

expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) and a proprietary 

EST DNA database (LIFESEQ*, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 

homology search was performed using the computer program BLAST or BLAST2 (Altshul et al.. Methods in 

Enzvmologv 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
10 or greater that did not encode known proteins were clustered and a^embled into a consensus DNA sequence with 

the program "phrap" (Phil Green, University of Washington. Seattle, Washington). The consensus sequence 

obtained therefrom is herein designated DNA56102. 

In light of an observed sequence homology between the DNA56102 consensus sequence and an EST 

sequence encompassed within the Incyte EST clone no. 2858870, the Incyte EST clone 2858870 was purchased 
15 and the cDNA insen was obtained and sequenced. It was found that this insert encoded a full-length protein. 

The sequence of this cDNA insert is shown in Figure 253 and is herein designated as DNA59849-1504. 

The full length clone shown in Figure 253 contained a single open reading frame with an apparent 

iranslational initiation site at nucleotide positions 158-160 and ending at the stop codon found at nucleotide 

positions 563-565 (Figure 253; SEQ ID NO:358). The predicted polypeptide precursor (Figure 254. SEQ ID 
20 NO:359) is 135 amino acids long. PROl 155 has a calculated molecular weight of approximately 14,833 dalions 

and an estimated pi of approximately 9.78. Clone ON A59849- 1 504 has been deposited with ATCC and is 

assigned ATCC deposit no. 209986. It is understood that the actual clone has the correct sequence whereas 

herein are only representations which are prone to minor sequencing errors. 

Based on a WU-BLAST2 sequence alignment analysis (using the ALIGN computer program) of the full- 
25 length sequence, PROl 155 shows some amino acid sequence identity with the following Dayhoff designations: 

TKNK_BOVIN; PVB19X587_1; AF019049_1; P_W00948; S72864; P^W00949; 162742; AF038501_1; 

TKNG^HUMAN; and YATl^RHOBL. Based on the mformation provided herein, PROl 155 may play a role 

in providing neuroprotection and cognitive enhancement. 

30 EXAMPLE 113: Isolation of cDNA clones Encoding Human PROl 156 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
EST cluster sequence from the Incyte database, designated herein as 138851. This EST cluster sequence was 
then compared to a variety of expressed sequence tag (EST) databases which included public EST databases 
(e.g., CjenBank) and a proprietary EST DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto. CA) to 

35 identify existing homologies. The homology search was performed using the computer program BLAST or 
BLAST2 (Altshul ei al.. Methods in Enzvmologv 266:460-480 (1996)). Those comparisons resulting in a 
BLAST score of 70 (or in some cases 90) or greater that did not encode known proteins were clustered and 
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assembled into a consensus DNA sequence with the program '^phrap*' (Phil Green, University of Washington. 
Seattle, Washington). The consensus sequence obtained therefrom is herein designated DNA56261. 

In light of an observed sequence homology between the DNA56261 consensus sequence and an EST 
sequence encompassed within the Incyte EST clone no. 3675191. the Incyie EST clone 3675191 was purchased 
and the cDNA insen was obtained and sequenced. It was foimd that this insert encoded a full-length protein. 
5 The sequence of this cDNA insen is shown in Figure 255 and is herein designated as DNA59853*1505. 

The lull length clone shown in Figure 255 contained a single open reading frame with an apparent 
transladonal initiation site at nucleotide positions 212-214 and ending at the stop codon found at nucleotide 
positions 689-691 (Figure 255; SEQ ID NO:360). The predicted polypeptide precursor (Figure 256. SEQ ID 
NO:36I) is 159 amino acids long.* PROI 156 has a calculated molecular weight of approximately 17.476 daltons. 
10 an estimated pl of approximately 9.15, a signal peptide sequence at about amino acids I to about 22, and 
potential N-glycosylation sites at about amino acids 27-30 and 41-44. 

Clone DNA59853-1505 was deposited mth the ATCC on June 16, 1998 and is assigned ATCC deposit 
no. 209985. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
IS alignment analysis (using the ALIGN computer program) of the full-length sequence shown in Figure 256 (SEQ 
ID NO:361). revealed some homology between the PROI 156 amino acid sequence and the following Dayhoff 
sequences: D45027_l. P_R79914. JC5309, KBF2^HUMAN, AF010144_^1. GEN14351, S68681. P_R79915. 
ZMTACJ. and HUMCPGO^l . 

20 EXAMPLE 114: Isolation of cDNA Clones Encoding Human PRO1098 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
EST cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g. , GenBank) and a proprietary 
EST DNA database (LIFESEQ^^. Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 

25 homology search was performed using the computer program BLAST or BLAST2 (Altshul et al., Methods in 
Enzvmology 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled imo a consensus DNA sequence with 
the program "phrap" (Phil Green. University of Washington, Seattle, Washington). The consensus sequence 
obtained therefrom is herein designated DNA56377. 

30 In light of an observed sequence homology between the DNA56377 consensus sequence and an EST 

sequence encompassed within the Incyte EST clone no. 3050917. the Incyte EST clone 3050917 was purchased 
and the cDNA insen was obtained and sequenced. It was foimd that this insen encoded a full-length protein. 
The sequence of this cDNA insen is shown in Figure 257 and is herein designated as DNA59854-1459. 

The entire nucleotide sequence of DNA59854-1459 is shown in Figure 257 (SEQ ID NO:362). Clone 

35 DNA59854-I459 contains a single open reading frame with an apparent translational initiation site at nucleotide 
positions 58-60 and ending at the stop codon at nucleotide poshions 292-294 of SEQ ID NO:362 (Figure 257). 
The predicted polypeptide precursor is 78 amino acids long (Figure 258). The full-length PRO 1098 protein 
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shown in Figure 258 has an estimated moItt:ular weight of about 8.396 daiions and a pi of about 7.66. Clone 
DNA59854-1459 has been deposited with ATCC on June 16. 1998. It is understood that the deposited clone 
has the actual nucleic acid sequence and that the sequences provided herein are based on known sequencing 
techniques. 

Analyzing Figure 258, a signal peptide appears to be at about amino acids M9 of SEQ ID NO:363. 
5 an N-glycosylaiion site appears to be at about amino acids 37-40 of SEQ ID NO:363. and N-myristoylation sites 
appear to be at about 15-20, 19-24 and 60-65 of SEQ ID NO:363. 

EXAMPLE 115 : Isolation of cDNA clones Encoding Human PROl 127 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 

10 EST cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g. . GenBank) and a proprietary 
EST DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 
homology search was performed using the computer program BLAST or BLAST2 (Altshui et al., Methods in 
Enzvmoloev 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 

15 or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 
obtained therefrom is herein designated DNA57959. 

In light of an observed sequence homology between the ON A57959 consensus sequence and an EST 
sequence encompassed within the Merck EST clone no. 685126, the Merck EST clone 685126 was purchased 

20 and the cDNA insert was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insen is shown in Figure 259 and is herein designated as DNA60283-1484. 

The fiill length clone shown in Figure 259 contained a single open reading frame with an apparent 
translational initiation site at nucleotide petitions 126-128 and ending at the stop codon found at nucleotide 
positions 327-329 (Figure 259; SEQ ID NO:364). The predicted polypeptide precursor (Figure 260. SEQ ID 

25 NO:365) is 67 amino acids long including a signal peptide at about 1-29 of SEQ ID NO:365. PROl 127 has a 
calculated molecular weight of approximately 7,528 daitons and an estimated pi of approximately 4.95. Clone 
DNA60283-1484 was deposited with the ATCC on July 1, 1998 and is assigned ATCC deposit no. 203043. 
It is understood that the deposited clone has the actual sequence, whereas representations which may have minor 
sequencing errors are presented herein. 

30 An analysis of the Dayhoff database (version 35.45 SwissProt 35). using a WU-BLAST2 sequence 

alignment analysis of the full-length sequence shown in Figure 260 (SEQ ID NO:365), revealed some homology 
between the PROl 127 amino acid sequence and the following Dayhoff sequences: AF037218_48, P_W09638. 
HBA_HETPO, S39821, KR2_EBV, CET20D3_8, HCU37630^1. HS193B12_10, S40012 and TRITUBC_1. 

35 
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EXAMPLE 116: Isolation of cDNA clones Encoding Human PRQl 126 

Use of the signal sequence algoiithm described in Example 3 abovt allowed identification of a single 
EST cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) and a proprietary 
EST DNA database (LIFESEQ*, Incyte Pharmaceuticals. Palo Alto, CA) to identify existing homologies. The 
5 homology search was performed using the computer program BLAST or BLAST2 (Altshul et al.. Methods in 
Enzvmology 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled mto a consensus DNA sequence with 
the program "phrap" (Phil Green, University of Washington, Seanle, Washington). The consensus sequence 
obtained therefrom is herein designated DNA56250. 

10 In light of an observed sequence homology between the DNA56250 consensus sequence and an EST 

sequence encompassed within the Incyte EST clone no. 1437250, the Incyte EST clone 1437250 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insert encoded a (ull-length protein. 
The sequence of this cDNA insen is shown in Figure 261 and is herein designated as DNA606I5-1483. 

Clone DNA60615-1483 contains a single open reading frame with an apparent translational initiation 

15 site at nucleotide positions 110-112 and ending at the stop codon at nucleotide positions 13I6-I318 (Figure 261). 
The predicted polypeptide precursor is 402 amino acids long (Figure 262). The ftilMength PROl 126 protein 
shown in Figure 262 has an estimated molecular weight of about 45,92 1 daltons and a pi of about 8.60. Analysis 
of the full-length PROl 126 sequence shown in Figure 262 (SEQ ID NO:367) evidences the presence of the 
following: a signal peptide from about amino acid 1 to about amino acid 25 and potential N-glycosylation sites 

20 from about amino acid 66 to about amino acid 69, from about amino acid 1 38 to about amino acid 141 and from 
about amino acid 183 to about amino acid 186. Clone DNA60615-1483 has been deposited with ATCC on June 
16, 1998 and is assigned ATCC deposit no. 209980. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35). using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 262 (SEQ ID NO:367). evidenced significant 

25 homology between the PROl 126 amino acid sequence and the following Dayhoff sequences: 173636, 
NOMR^HUMAN, MMUSMY0C3^1, HS454G6.1, P^R98225, RNU78105_1, RNU72487J, AF03530l_I, 
CEELC48E7j4 and CEFl 1C3 J. 

EXAMPLE 117: Isolation of cDNA clones Encoding Human PROl 125 

30 Use of the signal sequence algoridun described in Example 3 above allowed identification of a single 

EST cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g. , GenBank) and a proprietary 
EST DNA database (UFESEQ* Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 
homology search was performed using the computer program BLAST or BLAST2 (Altshul et al.. Methods in 

35 Enzvmology 266:460^480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program ''phrap'* (Phil Green. University of Washington. Seattle. Vvashington). The consensus sequence 
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obtained therefrom is herein designated DNA56540. 

In light of an observed sequence homology between the DNA56540 consensus sequence and an EST 
sequence encompassed within the Incyte EST clone no. 14861 14, the Incyie EST clone 14861 14 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insen encoded a full-length protein. 
The sequence of diis cDNA msen is shown in Figure 263 and is herein designated as DNA60615-1483. 
5 The full length clone shown in Figure 263 contained a single open reading frame with an apparent 

translational initiation site at nucleotide positions 47-49 and ending at the stop codon found at nucleotide positions 
1388-1390 (Figure 263; SEQ ID NO:368). The predicted polypeptide precursor (Figure 264, SEQ ID NO:369) 
is 447 amino acids long. PROl 125 has a calculated molecular weight of approximately 49.798 daltons and an 
estimated pl of approximately 9.78. Clone DNA60619-1482 has been deposited witii ATCC and is assigned 

10 ATCC deposit no. 209993. It is understood tiiat the clone has die actual sequence and tiiat the sequences herein 
are representations based on current techniques which may be prone to minor errors. 

Based on a WU-BLAST2 sequence alignment analysis (using the ALIGN computer program) of the full- 
length sequence, PROl 125 shows some sequence identity witii the following Dayhoff designations: 
RC01_NEUCR; S58306; PKWA^THECU; S76086; P,R85881: HETl.PODAN; SPU92792,1; 

15 APAF^HUMAN; S76414 and S59317. 

EXAMPLE 118 : Isolation of cDNA clones Encoding Human PROl IRft 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
EST cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 

20 expressed sequence tag (EST) databases which mcluded public EST databases (e.g. , GenBank) and a proprietary 
EST DNA database (LIFESEQ* Incyte Pharmaceuticals, Palo Alto. CA) to identify existing homologies. The 
homology search was performed using die computer program BLAST or BLAST2 (Altshul et al.. Methods in 
EnzymoloRY 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater diat did not encode known proteins were clustered and assembled into a consensus DNA sequence with 

25 die program "phrap" (Phil Green, University of Washington, Scattie, Washington). The consensus sequence 
obtained dierefrom is herein designated DNA56748. 

In light of an observed sequence homology between die DNA56748 consensus sequence and an EST 
sequence encompassed within die Incyte EST clone no. 3476792, die Incyte EST clone 3476792 was purchased 
and die cDNA insert was obtained and sequenced. It was found diat diis insen encoded a full-lengdi protein. 

30 The sequence of this cDNA insert is shown in Figure 265 and is herein designated as DNA60621-1516. 

The full length clone shown in Figure 265 contained a single open reading frame widi an apparent 
translational initiation site at nucleotide positions 91-93 and ending at die stop codon found at nucleotide positions 
406-408 (Figure 265; SEQ ID NO:370). The predicted polypeptide precursor (Figure 266. SEQ ID NO:371) 
is 105 amino acids long. The signal peptide is at amino acids M9 of SEQ ID NO:371. PROl 186 has a 

35 calculated molecular weight of approximately 11,715 daltons and an estimated pl of approximately 9.05. Clone 
DNA60621-1516 was deposited widi die ATCC on August 4. 1998 and is assigned ATCC deposit no. 203091. 
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An analysis of the Dayhoff database (version 35.45 SwissProt 35). using a WU-BLAST2 sequence 
aligiunent analysis of the full-length sequence shown in Figure 266 (SEQ ID NO:371), revealed some sequence 
identity between the PR01186 amino acid sequence and the following Dayhoff sequences: VPRA DENPO, 
LFE4_CHICK. AF034208_1. AF030433_1. A55035. COL_RABIT. CELB0507_9. S67826_l. S34665 and 
CRU73817_1. 

EXAMPLE 1 19: Isolation of cDNA clones Encoding Human PRQl IQ8 

An initial DNA sequence referred to herein as DNA52083 was identified using a yeast screen in a 
human umbUical vein endothelial cell cDNA Ubrary that preferentially represents the 5' ends of the primary 
cDNA clones. DNA52083 was compared to ESTs from public databases (e.g.. GenBank). and a proprietary 
EST database (LIFESEQ*. Incyte Pharmaceuticals, Palo Alto, CA), using the computer piogiam BLAST or 
BLAST2 lAltschul et al.. Methods in Enzymolofy 266:460-480 (1996)]. The ESTs were clustered and 
assembled into a consensus DNA sequence using the computer program "irtirap" (Phil Green. University of 
Washington, SeatUe. Washington). One or more of the ESTs was obuined from human breast skin tissue 
biopsy. This consensus sequence is designated herein as DNA52780. 
15 In light of an observed sequence homology between the DNA52780 consensus sequence and an EST 

sequence encompassed within the Incyte EST clone no. 3852910. the Incyte EST clone 3852910 was purchased 
and the cDNA insen was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 267 and is herein designated as DNA60622-1525. 

The fiill length DNA60622-1525 clone shown in Figure 267 (SEQ ID NO:372) contained a single open 
20 reading frame with an apparent translational initiation site at nucleotide positions 54 to 56 and ending at die stop 
codon found at nucleotide positions 741 to 743. The predicted polypeptide precursor, which is shown in Figure 
268 (SEQ ID NO:373). is 229 amino acids long. PRQl 198 has a calculated molecular weight of approximately 
25,764 daltons and an estimated pi of approximately 9. 17. There is a signal peptide sequence at about amino 
acids 1 through 34. There is sequence identity with glycosyl hydrolases family 3 1 protein at about amino acids 
25 142 to about 175. 

An analysb of the Daj*off database (version 35.45 SwissProt 35). using a WU-BLAST2 sequence 
alignment analysis of the fiill-length sequence shown in Figure 268 (SEQ ID NO:373), revealed some homology 
between die PROI 198 amino acid sequence and the following Dayhoff sequences: ATF6H11 6, UCRI RAT. 
TOBSUP2NT_l, RCUERF3_I. AMU88186_1. P_W22485. S56579. AF040711_1. DPP4_PIG. 
30 Clone DNA60622-1525 was been deposited with the ATCC on August 4. 1998, and is assigned ATCC 

deposit no. 203090. 

EXAMPLE 120: Isolation of cDNA clones Encoding Human PRQl 1S« 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
35 EST cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g. . GenBank) and a proprietary 
EST DNA database (LIFESEQ*, Incyte Pharmaceuticals. Palo Alto. CA) to identify existing homologies. The 
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homology search was pcrformwl using the computer program BLAST or BLAST2 (Altshul et al., Methods in 
Enzvmology 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled into a consensus DN A sequence with 
the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 
obtained therefrom is herein designated DNA57248. 
5 In light of an observed sequence homology between the DN A57248 consensus sequence and an EST 

sequence encon^jassed within the Incyte EST clone no. 2640776. the Incyte EST clone 2640776 was purchased 
and the cDNA insen was obtained and sequenced. It was found that this insert encoded a fiilMength protein. 
The sequence of this cDNA insert is shown in Figure 269 and is herein designated as DNA60625-1507. 

The ftill length clone shown in Figure 269 contained a single open reading frame with an apparent 
10 translational initiation site at nucleotide positions 163 to 165 and ending at the stop codon found at nucleotide 
positions 532 to 534 (Figure 269; SEQ ID NO:374). The predicted polypeptide precursor (Figure 270, SEQ 
ID NO:375) is 123 amino acids long. PR01158 has a calculated molecular weight of approximately 13,1 13 
daltons and an estimated pi of approximately 8.53. Additional features include a signal peptide sequence at about 
amino acids 1-19. a transmembrane domain at about amino acids 56-80, and a potential N-glycosylation site at 
15 about amino acids 36-39. Clone DNA60625-1507 was deposited with the ATCC on June 16, 1998 and is 
assigned ATCC deposit no. 209975. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the ftill-length sequence shown in Figure 270 (SEQ ID NO:375), revealed some homology 
between the PROl 158 amino acid sequence and the following Dayhoff sequences: ATAC00310510F18A8.10. 
20 P_R8515l, PHS2^S0LTU, RNMHCIBAC^l, RNAlFMHC^l. 168771, RNRT1A10G_1. PTPA^HUMAN, 
HUMGACA_1. and CHKPTPA^l. 

EXAMPLE 121 : Isolation of cDNA clones Encoding Human PROl 159 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 

25 EST cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 
expressed sequence tag (EST) daubases which included public EST databases (e.g. , GenBank) and a proprietary 
EST DNA database (LIFESEQ®, Incyte Pharmaceuticals. Palo Alto, CA) to identify existing homologies. The 
homology search was performed using the computer program BLAST or BLAST2 (Altshul et ah. Methods in 
Enzvmologv 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 

30 or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program "phrap" (Phil Green, University of Washington, Seattle. Washington). The consensus sequence 
obtained therefrom is herein designated DNA57221 . 

In light of an observed sequence homology between the DNA57221 consensus sequence and an EST 
sequence encompassed within the Incyte EST clone no. 376776, the Incyte EST clone 376776 was purchased 

35 and the cDNA insen was obtained and sequenced. It was found that this insen encoded a full-length protein. 
The sequence of this cDNA insen is shown in Figure 271 and is herein designated as DNA60627-1508. 
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Clone DNA60627-1508 contains a single open reading frame with an apparent translational initiation 
site at nucleotide positions 92-94 and ending at the stop codon at nucleotide positions 362-364 (Figure 271). The 
predicted polypeptide precursor is 90 amino acids long (Figure 272). The full-length PRO 1 159 protein shown 
in Figure 272 has an esdmated molecular weight of about 9,840 daltons and a pi of about 10. 13. Analysis of 
ihe fiUl-iength PR011S9 sequence shown in Figure 272 (SEQ ID NO:377) evidences the presence of the 
5 following: a signal peptide from about amino acid 1 to about amino acid IS and a potential N-glycosylation site 
from about amino acid 38 to about amino acid 41 . Clone DNA60627-1508 has been deposited with ATCC on 
August 4, 1998 and is assigned ATCC deposit no. 203092. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 272 (SEQ ID NO:377), evidenced significant 
10 homology between the PR01159 amino acid sequence and the following Dayhoff sequences: AF016494 6, 
AF036708J0, DSSCUTE^l, D89100^l, S28060, MEFA^XENLA, AF020798_12,G70065,E64423,JQ2005. 

EXAMPLE 122: Isolation of cDNA clones Encoding Human PRO 1124 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 

15 EST cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g. , GenBank) and a proprietaiy 
EST DNA database (LIFESEQ®, Incyie Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 
homology search was performed using the computer program BLAST or BLAST2 (Altshul et al.. Methods in 
Enzvmology 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 

20 or greater thai did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program **phrap'* (Phil Green, University of Washington, Seattle. Washington). The consensus sequence 
obtained therefrom is herein designated DNA56035. 

In light of an observed sequence homology between the DNA56035 consensus sequence and an EST 
sequence encompassed within the Incyte EST clone no. 2767646, the Incyte EST clone 2767646 was purchased 

25 and the cDNA insen was obuined and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insert is shown in Figure 273 and is herein designated as DNA60629-1481 . 

The full length clone shown in Figure 273 contained a single open reading frame with an apparent 
translational initiation site at nucleotide positions 25-27 and ending at the stop codon found at nucleotide positions 
2782-2784 (Figure 273; SEQ ID NO:378). The predicted polypeptide precursor (Figure 274, SEQ ID NO:379) 

30 is 919 amino acids long. PROl 124 has a calculated molecular weight of approximately 101 ,282 daltons and an 
estimated pi of approximately 5.37. Clone DNA60629-1481 has been deposited with the ATCC and is assigned 
ATCC deposit no. 209979. It is imderstood that the deposited clone has the actual sequence, whereas only 
representations based on current sequencing techniques which may include normal and minor errors, are 
provided herein. 

35 Based on a WU-BLAST2 sequence alignment analysis of the full-length sequence, PROl 124 shows 

significant amino acid sequence identity to a chloride channel protein and to ECAM-1. Specifically, the 
following Dayhoff designations were identified as having sequence identity with PROl 124: ECLC^BOVIN, 
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AF001261_1. P_W06548, SSC6A10_1, AF004355_1. S76691. AF017642, BYU06866^2, CSA^DICDI and 
SAU47139J. 

EXAMPLE 123 : Isolation of cDNA clones Encoding Human PRO 1287 

An expressed sequence lag (EST) DNA database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) 
5 was searched and an EST was identified which showed homology to the fringe protein. This EST sequence was 
then compared to various EST databases including public EST databases (e.g. , GenBank), and a proprietary EST 
database (LIFESEQ*. incyte Pharmaceuticals. Palo Alto, CA) to identify homologous EST sequences. The 
comparison was performed using the computer program BLAST or BLAST2 [Altschul et al.. Methods in 
Enzvmology. 266:460-480 (1996)]. Those comparisons resulting in a BLAST score of 70 (or in some cases, 

10 90) or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence 
with the program 'phrap" (Phil Green, University of Washingtoii, Seattle, Washington). This consensus 
sequence obtained is herein designated DNA40568. 

Based on the DNA40568 consensus sequence, oligonucleotides were synthesized: 1) to identify by PGR 
a cDNA library that contained the sequence of interest, and 2) for use as probes to isolate a clone of the ftill- 

15 length coding sequence for PR01287. Forward and reverse PGR primers generally range from 20 to 30 
nucleotides and are often designed to give a PGR product of about ICQ- 1000 bp in length. The probe sequences 
arc typically 40-55 bp in length. In some cases, additional oligonucleotides are synthesized when the consensus 
sequence is greater than about 1-1 .5kbp. In order to screen several libraries for a fiill-lengih clone, DNA from 
the libraries was screened by PGR amplification, as per Ausubel ei al.. Current Pro tocols in Molecular Biology. 

20 supra, with the PGR primer pair. A positive library was then used to isolate clones encoding the gene of interest 
using the probe oligonucleotide and one of the primer pairs. 

PGR primers (forward and reverse) were synthesized: 
forward PGR primer 5 -GTGGGGGAAAGGGACTTGATGTTGG-3' (SEQ ID NO:382) 
reverse PGR primer 1 5*^GGAAGGTGAGGGTGTATGTGGTGGG-3* (SEQ ID NO:383) 

25 reverse PGR primer 2 5'-CACiGGTAGAGGTATTGAGG-3* (SEQ ID NO:384) 

Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus DNA40568 
sequence which had the following nucleotide sequence 
hybridization probe 

5*-GAGTGAGTAGAATGGTGGGATAATATAGGGGGAGGATGATGGAGTGGG-3* (SEQ ID NO:385). 

30 In order to screen several libraries for a source of a full-length clone, DNA from the libraries was 

screened by PGR amplification with the PGR primer pairs identified above. A positive library was then used 
to isolate clones encoding the PR01287 gene using the probe oligonucleotide and one of the PGR primers. 

RNA for construction of the cDNA libraries was isolated from human bone marrow tissue. The cDN A 
libraries used to isolated the cDNA clones were constructed by standard methods using commercially available 

35 reagents such as those from Inviirogen, San Diego, GA. The cDNA was primed with oligo dT containing a Noil 
site, linked with blunt to Sail hemikinased adaptors, cleaved with Notl, sized appropriately by gel 
electrophoresis, and cloned in a defined orientation into a suitable cloning vector (such as pRKB or pRKD; 
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pRK5B is a precursor of pRK5D thai does not contain the Sfil site; see. Holmes et aJ., Science. 253:1278-1280 
(1991)) in the unique Xhol and Notl sites. 

DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for 
PR01287 (designated herein as DNA61755-1554 [Figure 275. SEQ ID NO:380]) and the derived protein 
sequence for PR01287. 

5 The entire nucleotide sequence of DNA61755-1554 is shown in Figure 275 (SEQ ID NO:380). The 

full length clone contained a single open reading frame with an apparent translational initiation site at nucleotide 
positions 655-657 and a stop signal at nucleotide positions 2251-2253 (Figure 275. SEQ ID NO:380). The 
predicted polypeptide precursor is 532 amino acids long, has a calculated molecular weight of approximately 
61 ,35 1 daltons and an estimated pi of approximately 8.77. Analysis of the full-length PR01287 sequence shown 

10 in Figure 276 (SEQ ID NO:381) evidences the presence of die following: a signal peptide from about amino acid 
1 to about amino acid 27 and potential N-glycosylation sites ftx>m about amino acid 315 to about amino acid 318 
and from about amino acid 324 to about amino acid 327. Clone DNA61755-1554 has been deposited with 
ATCC on August 1 1 , 1998 and is assigned ATCC deposit no. 2031 12. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 

15 alignment analysis of the full-length sequence shown in Figure 276 (SEQ' ID NO:381), evidenced significant 
homology between the PRO 1287 amino acid sequence and the following Dayhoff sequences: CET24D1 1, 
EZRI^BOVIN, GGU19889_1, CC3^YEAST, S74244, NALS_MOUSE, M0ES_P1G. S28660. S44860 and 
YNA4_CAEEL. 

20 EXAMPLE 124 : Isolation of cDNA clones Encoding Human PR01312 

DNA55773 was identified in a human fetal kidney cDNA library using a yeast screen that preferentially 
represents the 5' ends of the primary cDNA clones. Based on the DNA55773 sequence, oligonucleotides were 
syndiesized for use as probes to isolate a clone of the full-length coding sequence for PR01312. 

The fiill length DNA61873-1574 clone shown in Figure 277 (SEQ ID NO:386) contained a single open 

25 reading frame with an apparent translational initiation site at nucleotide positions 7-9 and ending at the stop 
codon found at nucleotide positions 643-645. The predicted polypeptide precursor is 212 amino acids long 
(Figure 278, SEQ ID NO:387). PR01312 has a calculated molecular weight of approximately 24,024 daltons 
and an estimated pl of approximately 6.26. Other feamres include a signal peptide at about amino acids 1-14; 
a transmembrane domain at about amino acids 141-160, and potential N-glycosylation sites at about amino acids 

30 76-79 and 93-96. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the lull-length sequence shown in Figure 278 (SEQ ID NO:387), revealed some homology 
between the PR01312 amino acid sequence and the following Dayhoff sequences: GCINTALPH^l. 
GIBMUC1A_1, P^R96298, AF001406^1, PVU88874^i, P_R8515l, AF041409^1, CELC50F2^7, C45875, 
35 and AB009510_21. 

Clone DNA61873-1574 has been deposited widi ATCC and is assigned ATCC deposit no. 203132. 
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EXAMPLE 125 : Isolation of cDNA clones Encoding Human PROl 192 

A consensus DNA sequence was assembled relative to other EST sequences using phrap as described 
in Example 1 above. TTiis consensus sequence is designated herein DNA35924, Based on the DNA35924 
consensus sequence, oligonucleotides were synthesized: 1) to identify by PGR a cDNA library that conuined 
the sequence of interest, and 2) for use as probes to isolate a clone of the fiilMength coding sequence for 
5 PROl 192. 

PGR primers (forward and reverse) were syndiesized: 
forward PGR primer : 5'<:GGAGGCGATGTAGAGGCGAGAGG-3* (SEQ ID NO:390) 
reverse PGR primer : 5'-ACAGGGAGAGGCAATGGGGAGAGG-3' (SEQ ID NO:391). 

Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus 
10 DNA3S924 sequence which had the following nucleotide sequence: 
hybridization probe: 

5*-GAGAGGA(n'GGGGGAGTTTGGGACCTTTGTGGAGAGGTGGTGATG.3' (SEQ ID NO:392). 

In order to screen several libraries for a source of a full-length clone, DNA from the libraries was 

screened by PGR amplification with the PGR primer pair identified above. A positive library was then used to 
15 isolate clones encoding the PROl 192 gene using the probe oligonucleotide and one of the PGR primers. RNA 

for construction of the cDNA libraries was isolated from human fetal liver and spleen tissue. 

DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for 

PROl 192 designated herein as DNA62814-1521 and shown in Figure 279 (SEQ ID NO:388); and the derived 

protein sequence for PROl 192 which is shown in Figure 280 (SEQ ID NO:389). 
20 The entire coding sequence of PROl 192 is shown in Figure 279 (SEQ ID NO:388). Gione DNA62814- 

1521 contains a single open reading frame with an apparent iranslaiional initiation site at nucleotide positions 

121-123 and an apparent stop codon at nucleotide positions 766-768. The predicted polypeptide precursor is 2 15 

amino acids long. The predicted polypeptide precursor has the following features: a signal peptide at about 

amino acids 1-21; a transmembrane domain at about amino acids 153-176; potential N-giycosylation sites at 
25 about amino acids 39-42 and 118-121; and homology with myelin PO proteins at about amino acids 27-68 and 

99-128 of Figure 280. The fiilMength PROl 192 protein shown in Figure 280 has an estimated molecular weight 

of about 24,484 daltons and a pl of about 6.98. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-B1-AST2 sequence 

alignment analysis of the full-length sequence shown in Figure 280 (SEQ ID NO;389), revealed homology 
30 between the PROl 192 amino acid sequence and the following Dayhoff sequences: GEN12838, MYPO_HUMAN, 

AF049498^i, GEN14531, P_W14146, HS46KDA,1. GINB^RAT, 0X2G_RAT, D87018_l, and D86996J. 
Glone DNA62814-1521 was deposited with the ATGG on August 4, 1998. and is assigned ATCG 

deposit no. 203093. 

35 
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EXAMPLE 126: Isolation of cDNA clones Encoding Human PRO 11 60 

A consensus DNA sequence was assembled relative to other EST sequences using phrap as described 
in Example 1 above This consensus sequence is herein designated DNA40650. Based on the DNA40650 
consensus sequence, oligonucleotides were synthesized: 1) to identify by PCR a cDNA library that contained 
the sequence of interest, and 2) for use as probes to isolate a clone of the iiilMength coding sequence for 
5 PRO1160. 

PCR primers (forward and reverse) were synthesized: 
forward PCR primer 5*-GCTCCCTGATCTTCATGTCACCACC-3* (SEQ ID NO:395) 
reverse PCR primer S'-CAGGGACACACTCTACCATTCGGGAn^-^' (SEQ ID NO:396) 
Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus DNA40650 
10 sequence which had the following nucleotide sequence 
hybridization probe 

5*-CCATCTTTCTGGTCTCTGCCCAGAATCCGACAAGAGCTG(rrC.3' (SEQ ID NO:397) 

In order to screen several libraries for a source of a full-length clone, DNA from the libraries was 

screened by PCR amplification with the PCR primer pair identified above. A positive library was then used to 
15 isolate cloites encoding die PROl 160 gene using the probe oligonucleotide" and one of the PCR primers. RNA 

for construction of the cDNA libraries was isolated from human breast tissue. 

DNA sequencing of die clones isolated as described above gave the full-length DNA sequence for 

PROl 160 (designated herein as DNA62872-1509 [Figure 281, SEQ ID NO: 393J) and die derived protein 

sequence for PROl 160. 

20 The entire nucleotide sequence of DNA62872-1509 is shown in Figure 281 (SEQ ID NO:393). Clone 

DNA62872-1509 contains a single open reading frame with an apparent transiational initiation site at nucleotide 
positions 40-42 and ending at the stop codon at nucleotide positions 310-312 (Figure 281). The predicted 
polypeptide precursor is 90 amino acids long (Figure 282). The fiill-lengdi PROl 160 protein shown in Figure 
282 has an estimated molecular weight of about 9,039 daltons and a pl of about 4.37. Analysis of the full-length 

25 PROl 160 sequence shown in Figure 282 (SEQ ID NO:394) evidences die presence of the following: a signal 
peptide from about amino acid I to about amino acid 19 and a protein kinase C phosphorylation site from about 
amino acid 68 to about amino acid 70. Clone DNA62872-1509 has been deposited widi ATCC on August 4. 
1998 and is assigned ATCC deposit no. 203100. 

An analysis of die Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 

30 alignment analysis of die fulMengdi sequence shown in Figure 282 (SEQ ID NO:394), evidenced significant 
homology between the PROl 160 amino acid sequence and die following Dayhoff sequences: B30305, 
GEN13490, 153641, S53363. HA34_BRELC, SP96^DICDI, S36326, SSU51197^10. MUCl.XENLA, 
TCU32448^l and AF000409^1. 

35 EXAMPLE 127 : Isolation of cDNA clones Encoding Human PRO! 187 

Use of die signal sequence algorithm described in Example 3 above allowed identification of a single 
EST cluster sequence from die Incyte database. This EST cluster sequence was then compared to a variety of 
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expressed sequence tag (EST) databases which included public EST databases (e.g. . CienBank) and a proprietary 
EST DNA database (LIFESEQ®, Incyte Pharmaceuticals. Palo Alto, CA) to identify existing homologies. The 
homology search was performed using the computer program BLAST or BLAST2 (Alishul ei al.. Methods in 
Enzvmology 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
5 the program "phrap" (Phil Green, University of Washington. Seattle, Washington). The consensus sequence 
obtained therefrom is herein designated DNA57726. 

In light of an observed sequence homology between the DNAS7726 consensus sequence and an EST 
sequence encompassed within the Incyte EST clone no. 358563, the Incyte EST clone 358563 was purchased 
and the cDNA insen was obtained and sequenced. It was found that this insert encoded a full-length protein. 

10 The sequence of this cDNA insen is shown in Figure 283 and is herein designated as DNA62876-1517. 

The full length clone shown in Figure 283 contained a single open reading frame with an apparent 
translational initiation site at nucleotide positions 121-123 and ending at the stop codon found at nucleotide 
positions 481-483 (Figure 283; SEQ ID NO:398). The predicted polypeptide precursor (Figure 284, SEQ ID 
NO:399) is 120 amino acids long. The signal peptide is at about amino acids M7 of SEQ ID NO:399. 

15 PR01187 has a calculated molecular weight of approximately 12,925 daltons and an estimated pi of 
approximately 9.46. Clone DNA62876-1517 was deposited with the ATCC on August 4, 1998 and is assigned 
ATCC deposit no. 203095. It is understood that the deposited clone contains the actual sequence and that the 
representations herein may have minor sequencing errors. 

An analysis of the Dayhoff database (version 35.45 SwissProi 35), using a WU-BLAST2 sequence 

20 alignment analysis of the full-length sequence shown in Figure 284 (SEQ ID NO:399), revealed some sequence 
identity (and therefore some relation) between the PRO 1 187 amino acid sequence and the following Dayhoff 
sequences: MGNENDOBX^l, CELF41G3^9, AMPG_STRLI, HSBBOVHERLJ. LEEXTENlO^l. 
AF029958^1 and P_W04957. 

25 EXAMPLE 128: Isolation of cDNA clones Encoding Human PROl 185 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
EST cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g. , GenBank) and a proprietary 
EST DNA database (LIFESEQ* Incyte Pharmaceuticals, Palo Alto» CA) to identify existing homologies. The 

30 homology search was performed using the computer program BLAST or BLAST2 (Altshul et al.. Methods in 
Enzvmologv 266:460480 (1996)). Those comparisons resulting m a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program "phrap" (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 
obtained therefrom is herein designated DNA56426. 

35 In light of an observed sequence homology between the DNA56426 consensus sequence and an EST 

sequence encompassed within the Incyte EST clone no. 328441 1, the Incyte EST clone 328441 1 was purchased 
and the cDNA insen was obtained and sequenced. It was foimd that this insert encoded a full-length protein. 
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Hie sequence of this cDNA insert is shown in Figure 285 and is herein designated as DNA62881-1515. 

TTie full length DNA6288M515 clone shown in Figure 285 contained a single open reading frame with 
an apparent translaiional initiation site at nucleotide positions 4-6 and ending at the stop codon found at 
nucleotide positions 598-600 (Figure 285; SEQ ID NO:400). The predicted polypepude precursor (Figure 286, 
SEQ ID NO:401) is 198 amino acids long. The signal peptide is at about amino acids 1-21 of SEQ ID N0:401 . 
5 PR01185 has a calculated molecular weight of approxinuiely 22,105 daltons and an estimated pi of 
approximately 7.73. Clone DNA62881-1515 has been deposited with the ATCC and is assigned ATCC deposit 
no. 203096. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 286 (SEQ ID NO:401), revealed some sequence 
10 identity between die PR01185 amino acid sequence and the following Dayhoff sequences: TUPl YEAST, 
AR)41382_1, MAOM^SOLTU, SPPBPHU9_1 ,141024. EPCPLCFAIL_1, HSPLEC^l, YKU^CAEEL, 
A44643,TGU65922^1. 

EXAMPLE 129: Isolation of cD NA clones Encoding Human PROms 
15 A consensus DNA sequence was assembled relative to other EST sequences using phrap as described 

in Example 1 above. This consensus sequence is herein designated DNA47364. Based on the DNA47364 
consensus sequence, oligonucleotides were synthesized: 1) to identify by PGR a cDNA library that contained 
the sequence of interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for 
PR01345. 

20 PGR primers (forward and reverse) were synthesized: 

forward PGR primer 5'-CCTGGTTATrrrrAnfiAArTrrr.Ar-^' (SEQ ID NO:404) 

reverse PGR primer 5'-CTrTTnrTr,fn'nrnAr Aonrrrr-i' (SEQ ID NO:405) 

Additionally, a synthetic oligonucleotide hybridization probe was constructed from the consensus DNA47364 

sequence which had the following nucleotide sequence 

25 hvbridization probe 

5*-CGCCCTCCAAGACTATGGTAAAAGGAGCCTGCCAGGTGTCAATGAC-3' (SEQ ID NO:406) 

In order to screen several libraries for a source of a full-length clone, DNA from the libraries was 

screened by PGR amplification with the PGR primer pair idemified above. A positive library was then used to 

isolate clones encoding the PR01345 gene using the probe oligonucleotide and one of the PGR primers. RNA 
30 for construction of the cDNA libraries was isolated from him:ian breast carcinoma tissue. 

DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for 

PR01345 (designated herein as DNA64852-1589 [Figure 287, SEQ ID NO:402]) and the derived protein 

sequence for PR01345. 

The entire nucleotide sequence of DNA64852.1589 is shown in Figure 287 (SEQ ID NO:402). Glone 
35 DNA64852-1589 contains a single open reading frame with an apparent u-anslational initiation site at nucleotide 
positions 7-9 or 34-36 and ending at the stop codon at nucleotide positions 625-627 (Figure 287). The predicted 
polypeptide precursor is 206 amino acids long (Figure 288). The full-length PR01345 protein shown in Figure 



480 



wo 99/63088 



PCTAJS99/122S2 



288 has an esiimaied moi«nilar weight of about 23,190 dalions and a pi of about 9.40. Analysis of the full- 
length PRO 1345 sequence shown in Figure 288 (SEQ ID NO:403) evidences the presence of the following: a 
signal peptide from about amino acid 1 to about amino acid 31 or from about amino acid 10 to about amino acid 
31 and a C-type lectin domain signamre sequence from about amino acid 176 to about amino acid 190. Clone 
DNA64852-1589 has been deposited with ATCC on August 18. 1998 and is assigned ATCC deposit no. 203127. 
5 An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 

alignment analysis of the full-length sequence shown in Figure 288 (SEQ ID NO:403), evidenced significant 
homology between the PR01345 amino acid sequence and the following Dayhoff sequences: BTU22298_1, 
TETN^CARSP, TETN^HUMAN. MABA^RAT, S34198, P_W13144, MACMBPA_1, A46274, PSPD^RAT 
AND P_R32188. 

10 

EXAMPLE 130: Isolation of cDNA clones Encoding Human PRO 1245 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
EST cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g. , GenBank) and a proprietary 

15 EST DNA database (LIFESEQ®. Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 
homology search was performed using the computer program BLAST or BLAST2 (Altshul et ah. Methods in 
Enzvmology 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program "phrap" (Phil Green, University of Washington, Seattle. Washington). The consensus sequence 

20 obtained therefrom is herein designated DNA56019. 

In light of an observed sequence homology between the DNA56019 consensus sequence and an EST 
sequence encompassed within the Incyte EST clone no. 1327836, the Incyte EST clone 1327836 was purchased 
and the cDNA insen was obtained and sequenced. It was found that this insert encoded a full-length protein. 
The sequence of this cDNA insen is shown in Figure 289 and is herein designated as DNA64884-1527. 

25 The fiill length clone shown in Figure 289 contained a single open reading frame with an apparent 

translational initiation site at nucleotide positions 79-81 and ending at the stop codon found at nucleotide positions 
391-393 (Figure 289; SEQ ID NO:407). The predicted polypeptide precursor (Figure 290. SEQ ID NO:408) 
is 104 amino acids long, with a signal peptide sequence at about aihino acid 1 to about amino acid 18. 
PR01245 has a calculated molecular weight of approximately 10,100 daltons and an estimated pi of 

30 approximately 8.76. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 290 (SEQ ID NO:408), revealed some homology 
between the PR01245 amino acid sequence and the following Dayhoff sequences: SYA^THETH, GENl 1 167, 
MTV044_4, ABOl 1151^1, RLAJ2750J, SNELIPTRA_l. S63624, C28391, A37907, and S14064. 

35 Clone DNA64884-1245 was deposited with the ATCC on August 25, 1998 and is assigned ATCC 

deposit no. 203155. 
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EXAMPLE 131: Isolation of c DNA clones Fncodinp Hnm ^ PR01358 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
EST cluster sequence from the Incyte database. TTiis EST cluster sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases(e.g..GenBa^^ 
EST DNA database (UFESEQ* Incyte Phannaceuticals. Palo Alto. CA) to idenufy existing homologies. The 
homology search was performed using the computer program BLAST or BLAST2 (Altshul et al.. Methods in 
l!aano!sgii266:46(M80 (1996)). Those comparisons resulting in aBLAST score of 70 (or in some cases 90) 
or greater that did not encode taiownproteinswereclusteredatul assembled intoaconsena^ 
the program "phrap" (PhU Green. University of Washington. Seaule. Washington). 

In light of an observed sequence homology between the consensus sequence and an ECT sequence 
encompassed within the Incyte EST clone no. 88718. Ae Incyte EST clone 887 18 was purchased and the cDNA 
insert was obtained and sequenced. It was found Aat this insen encoded a fiiU-length protein. The sequence 
of this cDNA insen is shown in Figure 291 and is herein designated as DNA64890-1612. 

The Ml length clone shown in Figure 291 contained a single open reading frame with an apparent 
translational initiation site at nucleotide positions 86 through 88 and ending at the stop codon found at nucleotide 
positions 1418 through 1420 (Figure 291 ; SEQ ID NO:409). The predicted polypeptide precursor (Figure 292. 
SEQ ID N0:410) is 444 amino acids long. The signal peptide is at about amino acids 1-18 of SEQ ID NO:410. 
PR01358 has a calculated molecular weight of approximately 50.719 daltons and an estimated pi of 
approximately 8.82. Clone DNA64890-1612 was deposited with the ATCC on August 18. 1998 and is assigned 
ATCC deposit no. 203131. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35). using a WU-BLAST2 sequence 

alignmemanalysisofthe full-length sequence shown inFigure 292 (SEQ ID NO:410). revealed sequence identic 
between the PR01358 amino acid sequence and the following Dayhoff sequences: P_W07607, AB000545 l 

AB000546J.AlAT_RAT.AB015164_l.P_P50021.COTR_CAVPO.andHAMHPPJ.TT,evariantsclaimed 
in this application exclude these sequences. 



EXAMPLg Isolation nf ^pNA clones Fn coding Hiimfln PRQi 195 

Use of the signal sequence algorithm described m Example 3 above allowed identification of a single 
EST cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 

expressed sequence tag (EST^ databases which included public EST da,abases(e.g..GenBank)andaprop^^^ 
EST DNA database aiFESEQ* Incyte Pharmacewicals. Palo Alto. CA) to identiiy existing homologies. TTte 
homology search was performed using the computer piogram BLAST or BLAST2 (Altshul et al.. Methods in 
Enaanolan 266:460-180 ( 1 996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program "phrap- (PhD Green. University of Washington. Seattle. Washington). TTte consensus sequence 
35 obtained therefrom is herein designated DNA55716. 

In light of an observed sequence homology between the DNA55716 consensus sequence and an EST 
sequence encompassed within the Incyte EST done no. 3252980. the Incyte EST clone 3252980 was purchased 
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and the cDNA insert was obtained and sequenced. It was found that this insen encoded a luIMength protein. 
The sequence of this cDNA insen Is shown in Figure 293 and is herein designated as DNA65412-1523. 

The full length clone shown in Figure 293 contained a single open reading frame with an apparent 
transiatlonal initiation site at nucleotide positions 58-60 and ending at the stop codon found at nucleotide positions 
511-513 (Figure 293; SEQ ID N0:411). The predicted polypeptide precursor (Figure 294, SEQ ID NO:412) 
5 is 151 amino acids long. The signal sequence is at about amino acids 1-22 of SEQ ID NO:412. PROl 195 has 
a calculated molecular weight of approximately 11,211 daltons and an estimated pi of approximately 5.33. Clone 
DNA65412-1523 was deposited with the ATCC on August 4. 1998 and is assigned ATCC deposit no. 203094. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 294 (SEQ ID NO:412). revealed some sequence 
10 identity between the PROl 195 amino acid sequence and the following Dayhoff sequences: MMU28486 1, 
AF044205_1, P^W31186. CELK03C7_1, F69034, EF1A_METVA, AF024540^1, SSU90353_1. 
MRSP_STAAU and P__R97680. 

EXAMPLE 133: Isolation of cDNA clones Encoding Human PRO1270 

15 Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 

EST cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g. , GenBank) and a proprietary 
EST D^^ A database (LIFESEQ®, Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 
homology search was performed using die computer program BLAST or BLAST2 (Altshul ei al.. Methods in 

20 Enzvmology 266:460^80 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program **phrap" (Phil Green, University of Washington. Seattle, Washington). The consensus sequence 
obtained therefrom is herein designated DNA57951. 

In light of an observed sequence homology between the DNA57951 consensus sequence and an EST 

25 sequence encompassed within the Merck EST clone no. 124878, the Merck EST clone 124878 was purchased 
and the cDNA insen was obtained and sequenced. It was found that this insen encoded a full-length protein. 
The sequence of this cDNA insen is shown in Figure 295 and is herein designated as DNA66308-1537. 

Clone DNA66308-1537 contains a single open reading frame with an apparent translational initiation 
site at nucleotide positions 103-105 and ending at the stop codon at nucleotide positions 1042-1044 (Figure 295). 

30 The predicted polypeptide precursor is 313 amino acids long (Figure 296). The full-length PRO1270 protein 
shown in Figure 296 has an estimated molecular weight of about 34,978 daltons and a pl of about 5.71 . Analysis 
of the full-length PRO1270 sequence shown in Figure 296 (SEQ ID NO:414) evidences the presence of the 
following: a signal peptide from about amino acid I to about amino acid 16, a potential N-glycosylation site from 
about amino acid 163 to about amino acid 166 and glycosaminoglycan attachment sites from about amino acid 

35 74 to about amino acid 77 and from about amino acid 289 to about amino acid 292. Clone DNA66308- 1537 has 
been deposited with ATCC on August 25, 1998 and is assigned ATCC deposit no. 203159. 
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An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 

aligmnem analysis of the full-length sequence shown in Figure 296 (SEQ ID NO:414), evidenced significant 

homology between the PRO1270 amino acid sequence and the following Dayhoff sequences: XLU86699 1, 

S49589, nBA_PARPA.FffiB^HUMAN.P,R47189. AF(XM326_1.DRTENASCN_1. AF()(M327^ P_W0141 1 
and FIBG^BOVIN. 

5 

EXAMPLE 134: Isolation of cDNA clones Encoding Human PRO 127 1 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
EST cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g. . GenBank) and a proprietary 
10 EST DNA database (LIFESE<J». Incyte Pharmaceuticals, Palo Alto. CA) to identify existing homologies. The 
homology search was performed using the computer program BLAST or BLAST2 (Altshul et al.. Methods in 
Enzymology 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode iaiown proteins were clustered and assembled into a consensus DNA sequence with 
the program -phrap-* (Phil Green. University of Washington, SeatUe, Washington). The consensus sequence 

15 obtained therefrom is herein designated DNA57955. 

In light of an observed sequence homology between the DNA57955 consensus sequence and an EST 
sequence encompassed within the Merck EST clone no. AA625350. the Merck EST clone AA625350 was 
purchased and the cDNA insen was obtained and sequenced. It was found that this insen encoded a full-length 
protein. The sequence of this cDNA insen is shown in Figure 297 and is herein designated as DNA66309-i538. 

20 Clone DNA66309-1538 contains a single open reading frame with an apparent translational initiation 

site at nucleotide positions 94-96 and ending at the stop codon at nucleotide positions 7 1 8-720 (Figure 297). The 
predicted polypeptide precursor is 208 amino acids long (Figure 298). The full-length PR01271 protein shown 
in Figure 298 has an estimated molecular weight of about 21 ,53 1 daltons and a pi of about 8.99. Analysis of 
the fiiU-length PR01271 sequence shown in Figure 298 (SEQ ID NO:416) evidences the presence of the 

25 following: a signal peptide from about amino acid 1 to about amino acid 3 1 and a transmembrane domain from 
about amino acid 166 to about amino acid 187. Clone DNA66309-1538 has been deposited with ATCC on 
September 15, 1998 and is assigned ATCC deposit no. 203235. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
aligmnem analysis of the ftill-lcngth sequence shown in Figure 298 (SEQ ID NO:416), evidenced significant 

30 homology between the PROI271 amino acid sequence and the foUowmg Dayhoff sequences: S57180. S63257, 
AGA1_YEAST. BPU43599,1. YS8A.CAEEL, S67570, LSU54556j, S70305, VGLX_HSVEB, and 
D88733_l. 

EXAMPLE 135 : Isolation of cDNA clones Encoding Human PRnM7S 
35 A Merck/Wash. U. database was searched and a Merck EST was identified. This sequence was then 

put in a program which aligns it with other seequences from the Swiss-Prot public database, public EST 
databases (e.g., GenBank, Merck/Wash. U.), and a proprietaiy EST database (LIFESEQ*. Incyte 

484 



wo 99/63088 



PCTAJS99/I2252 



Pharmaceuticals, Palo Alto. CA). The search was performed usmg the computer program BLAST or BLAST2 
[Altschul et al., Methods in Enzymolo p iY 266:460-480 (1996)) as a comparison of the extracellular domain 
(ECD) protein sequences to a 6 frame translation of the EST sequences. Those comparisons resulting in a 
BLAST score of 70 (or in some cases, 90) or greater that did not encode known proteins were chisiered and 
assembled into consensus DNA sequences with the program **phrap" (Phil Green, University of Washington, 
5 Seattle, Washington). 

A consensus DNA sequence was assembled relative to other EST sequences using phrap. This 
consensus sequence is designated herein ''DNA67003". 

Based on theDNA67003 consensus sequence, the nucleic acid (SEQ ID NO:4I7) was identified in a 
human pancreas library. DNA sequencing of the clone gave the fiilMength DNA sequence for PRO 1375 and 
10 the derived protein sequence for PR0137S. 

The entire coding sequence of PR01375 is shown in Figure 299 (SEQ ID NO:417). Clone DNA67004- 
1614 contains a single open reading frame with an apparent translational initiation site at nucleotide posuions 
104-106 and an apparent stop codon at nucleotide positions 698-700 of SEQ ID NO:417. The predicted 
polypeptide precursor is 198 amino acids long. The transmembrane domains are at about amino acids 1 1-28 
15 (type II) and 103-125 of SEQ ID NO:418. Clone DNA67004-1614 has been deposited with ATCC and is 
assigned ATCC deposit no. 203 115. The full-length PRO 1375 protein shown in Figure 300 has an estimated 
molecular weight of about 22,531 daltons and a pl of about 8.47. 

An analysis of the Dayhoff database (version 35.45 SwissPror 35). using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 300 (SEQ ID N0:4 1 8), revealed sequence identity 
20 between the PR01375 amino acid sequence and the following Dayhoff sequences: AF026 198 5, CELR12C12_5, 
S73465, Y011_MYCPN, S64538_l. P_P8150, MUVSHPO10_l, VSH_MUMPL and CVU59751_5, 

EXAMPLE 136 : Isolation of cDNA clones Encoding Human PR01385 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 

25 EST cluster sequence from the Incyte database. This EST cluster sequehce was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g. , GenBank) and a proprietary 
EST DNA database (LIFESEQ®. Incyte Pharmaceuticals, Palo Alto, CA) to ideniily existing homologies. The 
homology search was performed using the computer program BLAST or BLAST2 (Altshul et al.. Methods in 
Enzvmolo^ 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 

30 or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program ''phrap'* (Phil Green, University of Washington, Seattle, Washington). The consensus sequence 
obtained therefrom is herein designated DNA57952. 

In light of an observed sequence homology between the DNA57952 consensus sequence and an EST 
sequence encompassed within the Incyte EST clone no. 3129630, the Incyte EST clone 3129630 was purchased 

35 and the cDNA insert was obtained and sequenced. It was found that this insen encoded a full-length protein. 
The sequence of this cDNA insen is shown in Figure 301 and is herein designated as DNA68869-1610. 
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Clone DNA68869-]6i0 contains a single open reading frame with an apparent iranslaiional initiation 
site at nucleotide positions 26-28 and ending at the stop codon at nucleotide positions 4 1(M 12 (Figure 301). The 
predicted polypeptide precursor is 128 amino acids long (Figure 302). The full-length PRO 1 385 protein shown 
in Figure 302 has an estimated molecular weight of about 13,663 daltons and a pi of about 10.97. Analysis of 
the full-length PR01385 sequence shown in Figure 302 (SEQ ID NO:420) evidences the presence of the 
5 following: a signal peptide from about amino acid 1 to about amino acid 28. and glycosylaminoglycan attachment 
sites from about amino acid 82 to about amino acid 85 and from about amino acid 91 to about amino acid 94. 
Clone DNA68869-1610 has been deposited with ATCC on August 25. 1998 and is assigned ATCC deposit no. 
203164. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
10 alignment analysis of the full-length sequence shown in Figure 302 (SEQ ID NO:420), evidenced low homology 
between the PR01385 amino acid sequence and the following Dayhoff sequences: CELT14A8_1, 
LMNACHRAl.l . HXD9_HUM AN. CHKCMLF^l , HS5PP34_2. DMDRING^l , A37107_l , 
MMLUNGENE.l, PUM^DROME and DMU251I7_1. 

15 EXAMPLE 137: Isolation of cDNA clones Enc oding Human PR01387 

Use of the signal sequence algorithm described in Example 3 above allowed identification of a single 
EST cluster sequence from the Incyte database. This EST cluster sequence was then compared to a variety of 
expressed sequence tag (EST) databases which included public EST databases (e.g., GenBank) and a proprietary 
EST DNA database (LIFESEQ* Incyte Pharmaceuticals, Palo Alto, CA) to identify existing homologies. The 

20 homology search was performed using the computer program BLAST or BLAST2 (Altshul et al.. Methods in 
Enzvmologv 266:460-480 (1996)). Those comparisons resulting in a BLAST score of 70 (or in some cases 90) 
or greater that did not encode known proteins were clustered and assembled into a consensus DNA sequence with 
the program "phrap" (Phil Green, University of Washington, Seattle. Washington). The consensus sequence 
obtained therefrom is herein designated DNA56259. 

25 In light of an observed sequence homology between the DN A56259 consensus sequence and an EST 

sequence encompassed within the Incyte EST clone no. 3507924, the Incyte EST clone 3507924 was purchased 
and the cDNA insert was obtained and sequenced. It was found that this insen encoded a fiill-lengih protein. 
The sequence of this cDNA insert is shown in Figure 303 and is herein designated as DNA68872-1620. 

Clone DNA68872-1620 contains a single open reading frame with an apparent translational initiation 

30 site at nucleotide positions 85-87 and ending at the stop codon at nucleotide positions 1267-1269 (Figure 303). 
The predicted polypeptide precursor is 394 amino acids long (Figure 304). The full-length PR01387 protein 
shown in Figure 304 has an estimated molecular weight of about 44,339 daltons and a pl of about 7. 10. Analysis 
of the full-length PRO 1387 sequence shown in Figure 304 (SEQ ID NO:422) evidences the presence of the 
following: a signal peptide from about amino acid 1 to about amino acid 19, a transmembrane domain from about 

35 amino acid 275 to about amino acid 296, potential N-glycosylation sites from about amino acid 76 to about amino 
acid 79, from about amino acid 231 to about amino acid 234, from about amino acid 302 to about amino acid 
305, from about amino acid 307 to about amino acid 310 and from about amino acid 376 to about amino acid 
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379, and amino acid sequence blocks having homology to myelin pO protein from about amino acid 210 to about 
amino acid 239 and from about amino acid 92 to about amino acid 121. Clone DNA68872-I620 has been 
deposited with ATCC on August 25, 1998 and is assigned ATCC deposit no. 203160. 

An analysis of the Dayhoff database (version 35.45 SwissProt 35). using a WU-BLAST2 sequence 
alignment analysis of the full-length sequence shown in Figure 304 (SEQ ID NO:422), evidenced significant 
5 homology between the PR01387 amino acid sequence and the following Dayhoff sequences: P_W36955, 
MYPO_^HETFR, HS46KDA_1. AF049498_i, MY(X)_HUMAN, AF030454^1, A53268, SHPTCRA.l. 
P_W14146 and GEN12838. 

EXAMPLE 138 : Isolation of cDNA clones Encoding Human PR01384 
10 A consensus DNA sequence was assembled relative to other EST sequences using phrap as described 

in Example 1 above. This consensus sequence is herein designated DNA54192. Based on the DNA54192 

sequence, oligonucleotides were synthesized: 1) to identify by PCR a cDNA library that contained the sequence 

of interest, and 2) for use as protes to isolate a clone of the full-length ccKiing sequence for PR01384. 
PCR primers (forward and reverse) were synthesized: 
15 forward PCR primer 5 -TGCAGCCCCTGTGACACAAACTGG-3* (SEQ ID NO:425) 

reverse PCR primer 5*-CTGAGATAACCGAGCCATCCTCCCAC-3' (SEQ ID NO:426) 

Additionally, a synthetic oligonucleotide hybridization probe was constructed from the DNA54192 

sequence which had the following nucleotide sequence: 

hybridization probe 

20 5*-GGAGATAGCTGCTATGGGTTCTTCAGGCACAACTTAACATGGGAAG-3' (SEQ ID NO:427) 

In order to screen several libraries for a source of a full-length clone, DNA from the libraries was 

screened by PCR amplification with the PCR primer pair identified above. A positive library was then used to 

isolate clones encoding the PRO 1384 gene using the probe oligonucleotide and one of the PGR primers. RNA 

for construction of the cDN A libraries was isolated from human fetal liver. 
25 DNA sequencing of the clones isolated as described above gave the full-length DNA sequence for 

PR01384 (designated herein as DNA71 159-1617 (Figure 305, SEQ ID NO:4231; and the derived protein 

sequence for PR01384. 

The entire coding sequence of PR01384 is shown in Figure 305 (SEQ ID NO:423). Clone DNA71 159- 
1617 conuins a single open reading frame with an apparent translational initiation site at nucleotide positions 
30 1 82- 184 and an apparent stop codon at nucleotide positions 869-87 1 . The predicted polypeptide precursor is 229 
amino acids long. The full-length PRO 1384 protein shown in Figure 306 has an estimated molecular weight of 
about 26,650 daltons and a pi of about 8.76. Additional features include a type II transmembrane domain at 
about amino acids 32-57, and potemial N-glycosylation sites at about amino acids 68-71 , 120-123, and 134-137. 

An analysis of die Dayhoff database (version 35.45 SwissProt 35), using a WU-BLAST2 sequence 
35 alignment analysis of die fiill-lengdi sequence shown in Figure 306 (SEQ ID NO:424), revealed homology 
between the PR01384 amino acid sequence and the following Dayhoff sequences: AF054819_1 . HSAJ 1687_1 , 
AF0095 1 1_1 ,AB0107 10.1 , GEN 13595, HSAJ673_1 , GEN 1396 1 , ABOO5900_l ,LECH.CHICK, AP021349.1 , 
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and NK13_RAT. 

Qone DNA7U59-1617 has been deposited with ATCC and is assigned ATCC deposit no. 203135. 

EXAMPLE 139 : Use of PRO as a hybridization probe 

The following method describes use of a nucleotide sequence encoding PRO as a hybridization probe. 
S DNA comprising the coding sequence of full-length or mature PRO as disclosed herein is employed as 

a probe to screen for homologous DN As (such as those encoding namrally-occurring variants of PRO) in human 
tissue cDNA libraries or himian tissue genomic libraries. 

Hybridization and washing of filters containing either library DNAs is performed under the following 
high stringency conditions. Hybridization of radiolabeled PRO-derived probe to the filters is performed in a 
10 solution of 50% formamide, 5x SSC, 0. 1 % SDS, 0. 1 % sodium pyrophosphate. 50 mM sodium phosphate, pH 
6.8, 2x Denhardt's solution, and 10% dextran sulfate at 42**C for 20 hours. Washing of the filters is performed 
in an aqueous solution of 0. Ix SSC and 0. 1 % SDS at 4TC. 

DNAs having a desired sequence identity with the DNA encoding fiilMength native sequence PRO can 
then be identified using standard techniques known in the an. 

15 

EXAMPLE 140: Expression of PRO in E. coli 

This example illustrates preparation of an imglycosylated form of PRO by recombinant expression in 

E. coli. 

The DNA sequence encoding PRO is initially amplified using selected PGR primers. The primers 
20 should contain restriction enzyme sites which correspond to the restriction enzyme sites on the selected 
expression vector. A variety of expression vectors may be employed. An example of a suiuble vector is 
pBR322 (derived from £. coli; see Bolivar et al.. Gene. 2:95 (1977)) which contains genes for ampicillin and 
tetracycline resistance. The vector is digested with restriction enzyme and dephosphorylated. The PGR 
amplified sequences are then ligated into the vector. The vector will preferably include sequences which encode 
25 for an antibiotic resistance gene, a trp promoter, a polyhis leader (including the first six STII codons, polyhis 
sequence, and enterokinase cleavage site), the PRO coding region, lambda transcriptional terminator, and an 
argU gene. 

The ligation mixnire is then used to transform a selected £. coli strain using the methods described in 
Sambrook et al. , supra . Transformants are identified by their ability to grow on LB plates and antibiotic resistant 
30 colonies are dien selected. Plasmid DNA can be isolated and confirmed by restriction analysis and DNA 
sequencing. 

Selected clones can be grown overnight in liquid culture medium such as LB broth supplemented with 
antibiotics. The overnight culmre may subsequently be used to inoculate a larger scale cuinire. The cells are 
then grown to a desired optical density, during which the expression promoter is mmed on. 
35 After culturing the cells for several more hours, the cells can be harvested by centrifugation. The cell 

pellet obtained by the centriiiigation can be solubilized using various agents known in the art, and the solubilized 
PRO protein can then be purified using a metal chelating column under conditions that allow tight binding of the 
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protein. 

PRO may be expressed in £. coU in a poly-His tagged form, using the following procedure. TTje DNA 
encoding PRO is initially amplified using selected PCR primers. The primers will comain restriction enzyme 
sites which correspond to the restriction enzyme sites on the selected expiession veaor. and other useful 
sequences providing for efficient and reliable translation initiation, rapid purification on a metal chelation 
5 column, and proteolytic removal with enterokinase. The PCR-amplified. poly-His tagged sequences are then 
ligated into an expression vector, which is used to transform an £. coU host based on strain 52 (W3110 
fuhA(tonA) Ion galE rpoHts(h,pR.s) clpP(laclq). Transformants are first grown in LB comaining 50 mg/ml 
carbenicillinat30-CwidishakinguntilanO.D.600of3-5isreached. Culmres are then diluted 50-100 fold into 
CRAP media (prepared by mixing 3.57 g (NH,),SO,. 0.71 g sodium citrate.2H20, 1.07 g KCl. 5.36 g Difco 
10 yeast extract. 5.36 g Sheffield hycase SF in 500 mL water, as weU as 1 10 mM MPOS. pH 7.3. 0.55% (w/v) 
glucose and 7 mM MgSOJ and grown for approximately 20-30 hours at 20''C with shaking. Samples are 
removed to verify expression by SDS-PAGE analysis, and the bulk culmre is centrifuged to pellet the cells. Cell 
pellets are frozen until purification and refolding. 

£. coU paste from 0.5 to 1 L fermentations (6-10 g pellets) is resuspended in 10 volumes (w/v) in 7 M 
15 guanidine. 20 mM Tris. pH 8 buffer. Solid sodium sulfite and sodium tetradiionate is added to make fmal 
concentrations of 0. IM and 0.02 M. respectively, and the solution is stirred overnight at 4'C. This step results 
in a denanired protein with all cysteine residues blocked by sulfiiolization. The solution is centrifuged at 40.000 
rpm in a Beckman Ultracentifuge for 30 min. The supernatant is diluted with 3-5 volumes of metal chelate 
column buffer (6 M guanidine. 20 mM Tris. pH 7.4) and filtered through 0.22 micron filters to clarify. The 
20 clarified extract is loaded onto a 5 ml Qiagen Ni-NTA metal chelate column equilibrated in the metal chelate 
column buffer. The column is washed with additional buffer containing 50 mM imidazole (Calbiochem. Utrol 
grade). pH 7.4. The protein is eluted with buffer containing 250 mM imidazole. Fractions containing the 
desired protein are pooled and stored at 4''C. Protein concentration is estimated by its absorbance at 280 mn 
using the calculated extinction coefficient based on its amino acid sequence. 
25 The proteins ate refolded by dUuting the sample slowly into freshly prepared refolding buffer consisting 

of: 20 mM Tris. pH 8.6. 0.3 M NaCl. 2.5 M urea. 5 mM cysteine. 20 mM glycine and 1 mM EDTA. 
Refolding volumes are chosen so that the final protein concentration is between 50 to 100 micrograms/ml. The 
refolding solution is stirred gently at 4-C for 12-36 hours. The refolding reaction is quenched by the addition 
of TFA to a final concentration of 0.4% (pH of approximately 3). Before further purificaUon of the protein, the 
30 solution is filtered through a 0.22 micron filter and acetonitrile is added to 2-10% final concentration. The 
refolded protein is chromaiographed on a Poros Rl/H reversed phase column using a mobUe buffer of 0.1% 
TFA with elution with a gradiem of acetonitrile from 10 to 80%. Aliquots of fractions with A280 absorbance 
are analyzed on SDS polyacrylamide gels and firaaions conuining homogeneous refolded protein an pooled. 
Generally, the properly refolded species of most proteins are eluted at the lowest concentrations of acetonitrile 
35 since those species are the most compact with their hydrophobic interiors shielded from interaction with the 
reversed phase resin. Aggregated species are usually eluted at higher acetonitrile concentrations. In addition 
to resolving misfolded forms of proteins from the desired form, die reversed phase step also removes endotoxin 
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from the samples. 

Fractions containing the desired folded PRO polypeptide are pooled and the acetonitrile removed using 
a gentle stream of nitrogen directed at the solution. Proteins are formulated into 20 mM Hepes, pH 6.8 with 
0. 14 M sodium chloride and 4 % mannitol by dialysis or by gel filtration using G25 Superfine (Pharmacia) resins 
equilibrated in the formulation buffer and sterile fdtered. 
5 Many of the PRO polypeptides disclosed herein were successfully expressed as described above. 

EXAMPLE 141: Ex pression of PRO in mammalian cells 

This example illustrates preparation of a potentially glycosylated form of PRO by recombinant 
expression in mammalian cells. 
10 The vector, pRK5 (see HP 307,247, published March 15. 1989), is employed as the expression vector. 

Optionally, die PRO DNA is ligated into pRK5 with selected restriction enzymes to allow insenion of the PRO 
DNA using ligation methods such as described in Sambrooketal., supra . The resulting vector is called pRK5- 
PRO. 

In one embodiment, the selected host cells may be 293 cells. Human 293 cells (ATCC CCL 1573) are 

IS grown to confluence in tissue culhire plates in medium such as DMEM supplemented with fetal calf serum and 
optionally, nutrient components and/or antibiotics. About 10 ^g pRK5-PRO DNA is mixed with about 1 fig 
DNA encoding the VA RNA gene [Thimmappaya et al.. Cell, 31-343 (1982)] and dissolved in 500 /xl of 1 mM 
Tris-HCI, 0. 1 mM EDTA. 0.227 M CaCl,. To diis mixnire is added, dropwise, 500 ^1 of 50 mM HEPES (pH 
7.35), 280 mM NaCl, 1.5 mM NaP04. and a precipitate is allowed to form for 10 minutes at 25*^0. The 

20 precipitate is suspended and added to die 293 cells and allowed to settle for about four hours at 37*C The 
culture medium is aspirated off and 2 ml of 20% glycerol in PBS is added for 30 seconds. The 293 cells are 
then washed with serum free medium, fresh medium is added and the cells are incubated for about 5 days. 

Approximately 24 hours after die transfections, die culture medium is removed and replaced widi culture 
medium (alone) or culture medium containing 200 ^Ci/ml ^S-cysteine and 200 /xCi/ml ^^S-mediionine. After 

25 a 12 hour incubation, the conditioned medium is collected, concentrated on a spin filter, and loaded onto a 15% 
SDS gel. The processed gel may be dried and exposed to film for a selected period of time to reveal die 
presence of PRO polypeptide. The cultures containing transfected cells may undergo further incubation (in 
serum free medium) and the medium is tested in selected bioassays. 

In an alternative technique, PRO may be introduced into 293 cells transiendy using die dextran sulfate 

30 mediod described by Somparyrac et al., Proc. Natl. Acad. Sci. . 12:7575 (1981). 293 cells are grown to 
maximal density in a spinner flask and 700 ^g pRK5-PR0 DNA is added. The cells are first concemrated from 
die spinner flask by centrifugation and washed widi PBS. The DNA-dextran precipitate is incubated on die cell 
pellet for four hours. The cells are treated widi 20% glycerol for 90 seconds, washed widi tissue culture 
medium, and re-introduced into die spinner flask containing tissue culture medium, 5 ftgfmi bovine insulin and 

35 0. 1 /ig/ml bovine transferrin. After about four days, the conditioned media is centrifiiged and filtered to remove 
cells and debris. The sample containing expressed PRO can dien be concentrated and purified by any selected 
method, such as dialysis and/or colwnn chromatography. 



490 



wo 99/63088 



PCTAJS99/12252 



In another embodiment, PRO can be expressed in CHO cells. The pRK5-PRO can be transfected into 
CHO cells using known reagents such as CaPO* or DEAE-dexiran. As described above, the cell cultures can 
be incubated, and the medium replaced with culture medium (alone) or medium containing a radiolabel such as 
^S-meihionine. After determining the presence of PRO polypeptide, the culture medium may be replaced with 
serum free medium. Preferably, the cultures are incubated for about 6 days, and then the conditioned medimn 
5 is harvested. The medium containing the expressed PRO can then be concentrated and purified by any selected 
method. 

Epitope-tagged PRO may also be expressed in host CHO cells. The PRO may be subcloned out of the 
pRK5 vector. The subclone insert can undergo PCR to fuse in frame with a selected epitope tag such as a poly- 
his tag into a Baculovinis expression vector. The poly-his tagged PRO insen can then be subcloned into a SV40 

10 driven vector containing a selection marker such as DHFR for selection of suble clones. Finally, the CHO cells 
can be transfected (as described above) with the SV40 driven vector. Labeling may be performed, as described 
above, to verify expression. The culnire medium containing the expressed poly-His tagged PRO can then be 
concentrated and purified by any selected method, such as by Ni-'^-chelate affinity chromatography. 

PRO may also be expressed in CHO and/or COS cells by a transient expression procedure or m CHO 

15 cells by anodier stable expression* procedure.. 

Stable expression in CHO cells is performed using the following procedure. The proteins are expressed 
as an IgG construct (immunoadhesin), in which the coding sequences for the soluble forms (e.g. extracellular 
domains) of the respective proteins are fused to an IgGl constant region sequence containing the hinge. CH2 and 
CH2 domains and/or is a poly-His tagged form. 

20 Following PCR amplification, the respective DNAs are subcloned in a CHO expression vector using 

standard techniques as described in Ausubel et al.. Current Proiocols of Molecular Biology. Unit 3.16, John 
Wiley and Sons (1997). CHO expression vectors are constructed to have compatible restriction sites 5' and 3' 
of the DNA of interest tp allow the convenient shuttling of cDNA*s. The vector used expression in CHO cells 
is as described in Lucas et al., Nucl. Acids Res. 24:9 (1774-1779 (1996), and uses the SV40 early 

25 promoter/enhancer to drive expression of the cDNA of interest and dihydrofolate reductase (DHFR). DHFR 
expression permits selection for stable maintenance of the plasmid following transfection. 

Twelve micrograms of the desired plasmid DNA is inuroduced imo approximately 10 million CHO cells 
using commercially available transfection reagents Superfect* (Quiagen), Dosper* or Fugene* (Boehringer 
Mannheim). The cells are grown as described in Lucas et al., supra . Approximately 3 x 10*^ cells are frozen 

30 in an ampule for further growth and production as described below. 

The ampules containing the plasmid DNA are thawed by placement into water badi and mixed by 
vonexing. The contents are pipetted into a centrifuge mbe containing 10 mLs of media and centrifuged at 1000 
rpm for 5 minutes. The supernatant is aspirated and the cells are resuspended in 10 mL of selective media (0.2 
fim filtered PS20 with 5% 0.2 //m diafiltered fetal bovine serum). The cells are then aliquoted into a 100 mL 

35 spinner containing 90 mL of selective media. After 1-2 days, the cells are transferred into a 250 mL spinner 
filled with 150 mL selective growth medium and incubated at 37?C. After another 2-3 days, 250 mL. 500 mL 
and 2(X)0 mL spinners are seeded widi 3 x 10^ cells/mL. The cell media is exchanged with fresh media by 
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centriftigation and resuspension in production medium. Although any suitable CHO media may be employed, 
a production medium described in U.S. Patent No. 5,122,469, issued June 16, 1992 may acmally be used. A 
3L producuon spinner is seeded at 1 .2 x 10^ cells/mL. On day 0, the cell number pH ie determined. On day 
1, the spinner is sampled and sparging with filtered air is commenced. On day 2, the spinner is sampled, the 
temperanire shifted to 33°C, and 30 mL of 500 g/L glucose and 0.6 mL of 10% antifoam (e.g., 35% 
5 polydimethylsiloxane emulsion, Dow Coming 365 Medical Grade Emulsion) taken. Throughout the production, 
the pH is adjusted as necessary to keep it at around 7.2. After 10 days, or until the viability dropped below 
70% , the cell culmre is harvested by centriftigation and filtering through a 0.22 ^m filter. The filtrate was either 
stored at 4**C or immediately loaded onto columns for purification. 

For the poly-His tagged constructs, the proteins are purified using a Ni-NTA column (Qiagen). Before 

10 purification, imidazole is added to the conditioned ihedia to a concentration of 5 mM. The conditioned media 
is pumped onto a 6 ml Ni-NTA column equilibrated in 20 mM Hepes, pH 7.4. buffer containing 0.3 M NaCI 
and 5 mM imidazole at a flow rate of 4-5 ml/min. at 4X. After loading, the column is washed with additional 
equilibration buffer and the protein eluted widi equilibration buffer containing 0.25 M imidazole. The highly 
purified protein is subsequently desalted into a storage bu^er containing 10 mM Hepes, 0. 14 M NaCl and 4% 

15 mannitol, pH 6.8, with a 25 ml G25 Superfine (Pharmacia) column and stored at -80°C. 

Immunoadhesin (Fc-containing) constructs are purified from the conditioned media as follows. The 
conditioned medium is pumped onto a 5 ml Protein A column (Pharmacia) which had been equilibrated in 20 
mM Na phosphate buffer. pH 6.8. After loading, the column is washed extensively with equilibration buffer 
before elution with 100 mM citric acid, pH 3.5. The eluted protein is immediately neutralized by collecting 1 

20 ml ftactions into tubes containing 275 of 1 M Tris buffer, pH 9. The highly piu-ified protein is subsequently 
desalted into storage buffer as described above for the poly-His ugged proteins. The homogeneity is assessed 
by SDS polyacrylamide gels and by N-ierminal amino acid sequencing by Edman degradation. 

Many of the PRO polypeptides disclosed herein were successfully expressed as described above. 

25 EXAMPLE 142: Expression of PRO in Yeast 

The following method describes recombinant expression of PRO in yeast. 

First, yeast expression vectors are constructed for intracellular production or secretion of PRO from 
the ADH2/GAPDH promoter. DN A encoding PRO and die promoter is inserted into suitable restriction enzyme 
sites in the selected plasmid to direct intracellular expression of PRO. For secretion, DNA encoding PRO can 

30 be cloned into the selected plasmid, together widi DNA encoding the ADH2/GAPDH promoter, a native PRO 
signal peptide or other mammalian signal peptide, or, for example, a yeast alpha-factor or invertase secretory 
signal/leader sequence, and linker sequences (if needed) for expression of PRO. 

Yeast cells, such as yeast strain AB 1 10, can then be transformed with the expression plasmids described 
above and culmred in selected fermentation media. The transformed yeast supematants can be analyzed by 

35 precipiution with 10% trichloroacetic acid and separation by SDS-PAGE, followed by staining of the gels with 
Coomassie Blue stain. 
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Recombinant PRO can subsequently be isolated and purified by removing the yeast ceils from the 
fermentation medium by centrifiigation and then concentrating the medium using selected cartridge filters. The 
concentrate containing PRO may further be purified using selected column chromatography resins. 

Many of the PRO polypeptides disclosed herein were successfully expressed as described above. 



5 EXAMPLE 143 : Expression of PRO in Baculovinis-Infected Insect Cells 

The following method describes recombinant expression of PRO in Baculovirus-infecied insect cells. 
The sequence coding for PRO is fused upstream of an epitope tag contained within a bacuiovirus 
e;q)ression vector. Such epitope tags include poly*his tags and immunoglobulin tags (like Fc regions of IgG). 
A variety of plasmids may be employed, including plasmids derived from commercially available plasmids such 

10 as pVL1393 (Novagen). Briefly, the sequence encoding PRO or the desired ponion of the coding sequence of 
PRO such as the sequence encoding the extracellular domain of a transmembrane protein or the sequence 
encoding the mature protein if the protein is extracellular is amplified by PCR with primers complementary to 
the 5* and 3' regions. The 5* primer may incorporate flanking (selected) restriction enzyme sites. The product 
is then digested with those selected restriction enzymes and subcloned into the expression vector. 

15 Recombinant bacuiovirus' is generated by co-transfeciing the above piasmid and BaculoGold*^ virus 

DNA (Pharmingen) into Spodoptera frugiperda (*'Sf9'') cells (ATCC CRL 171 1) using lipofectin (commercially 
available from GIBCO-BRL). After 4-5 days of incubation at 28°C, the released viruses are harvested and used 
for further amplifications. Viral infection and protein expression are performed as described by 0*Reilley et 
al., Bacuiovirus expression vectors: A Laboratory Manual. Oxford: Oxford University Press (1994). 

20 Expressed poly-his tagged PRO can then be purified, for example, by Ni-'^-chelaie affinity 

chromatography as follows. Extracts are prepared from recombinant virus- infected Sf9 cells as described by 
Rupen et al.. Nature . 362 : 175-179 (1993). Briefly, Sf9 cells are washed, resuspended in sonication buffer (25 
mL Hepes. pH 7.9; 12.5 mM MgClj; 0. 1 mM EDTA; 10% glycerol; 0. 1 % NP.40; 0.4 M KCl), and sonicated 
twice for 20 seconds on ice. The sonicates are cleared by centrifugation, and the supernatant is diluted 50-fold 

25 in loading buffer (50 mM phosphate, 300 mM NaCl, 10% glycerol, pH 7.8) and filtered dirough a 0.45 ^m 
filter. A Ni-^-NTA agarose column (conunercially available from Qiagen) is prepared with a bed volume of 5 
mL, washed with 25 mL of water and equilibrated with 25 mL of loading buffer. The filtered cell extract is 
loaded onto the colunm at 0.5 mL per minute. The column is washed to baseline A280 loading buffer, at 
which point fraction collection is started. Next, the column is washed with a secondary wash buffer (50 mM 

30 phosphate; 300 mM NaCl, 10% glycerol, pH 6.0), which elutes nonspecifically bound protein. After reaching 
Aaao baseline again, the column is developed with a 0 to 500 mM Imidazole gradient in the secondary wash 
buffer. One mL fractions are collected and analyzed by SDS-PAGE and silver staining or Western blot with 
Ni^'*'-NTA-conjugated to alkaline phosphatase (Qiagen). Fractions containing the eluted HiSio-tagged PRO are 
pooled and dialyzed against loading buffer. 

35 Alternatively, purification of the IgG tagged (or Fc tagged) PRO can be performed using known 

chromatography techniques, including for instance. Protein A or protein G column chromatography. 

Many of the PRO polypeptides disclosed herein were successfully expressed as described above. 
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EXAMPLE 144: Preparation of Antibodies that Bind PRO 

This example illustrates preparation of monoclonal antibodies which can specifically bind PRO. 
Techniques for producing die monoclonal antibodies are known in the an and are described, for 
instance, in Coding, suDra > Immunogens that may be employed include purified PRO. fusion proteins containing 
PRO, and cells expressing recombinant PRO on the cell surface. Selection of the immunogen can be made by 
5 the skilled anisan without undue experimentation. 

Mice, such as Balb/c, are immunized with the PRO immunogen emulsified in complete Freund's 
adjuvant and injected subcutaneously or intraperitoneally in an amount from 1-100 micrograms. Alternatively, 
the immunogen is emulsified in MPL-TDM adjuvant G^ibi Inununochemical Research. Hamilton. MT) and 
injected into the animal's hind foot pads. The inununized mice are then boosted 10 to 12 days later with 
10 additional immunogen emulsified in the selected adjuvant. Thereafter, for several weeks, the mice may also be 
boosted with adduional immunization injections. Serum samples may be periodically obtained from the mice 
by retro-orbital bleeding for testing in ELISA assays to detect anti-PRO antibodies. 

After a suitable antibody titer has been detected, the animals "positive" for antibodies can be injected 
with a final intravenous injection of PRO. Three to four days later, the mice are sacrificed and the spleen cells 
15 are harvested. The spleen cells are dien fijsed (using 35% polyethylene glycol) to a selected murine myeloma 
cell line such as P3X63AgU.l, available from ATCC, No. CRL 1597. The fusions generate hybridoma cells 
which can then be plated in 96 well tissue culture plates containing HAT (hypoxanihine, aminopterin, and 
thymidine) medium to inhibit proliferation of non-fiised cells, myeloma hybrids, and spleen cell hybrids. 

The hybridoma cells will be screened in an ELISA for reactivity against PRO. Determmation of 
20 "positive" hybridoma cells secreting the desired monoclonal antibodies against PRO is within the skill in the an. 

The positive hybridoma cells can be injected intraperitoneally into syngeneic Balb/c mice to produce 
ascites containing the anti-PRO monoclonal antibodies. Aliemaiively , the hybridoma cells can be grown in tissue 
culture flasks or roller bottles. Purification of the monoclonal antibodies produced in the ascites can be 
accomplished using anmioniimi sulfate precipitation, followed by gel exclusion chromatography. Aliemaiively, 
25 affinity chromatography based upon binding of antibody to protein A or protein G can be employed. 

EXAMPLE 145: Purification of PRO Polypeptides Using Specific Antibodies 

Native or recombinant PRO polypeptides may be purified by a variety of standard techniques in die an 

of protein purification. For example, pro-PRO polypeptide, mature PRO polypeptide, or prc-PRO polypeptide 
30 is purified by inomunoaffinity chromatography using amibodies specific for the PRO polypeptide of interest. In 

general, an immtmoaffinity column is constructed by covalendy coupling the anti-PRO polypeptide amibody to 

an activated chromatogr^hic resin. 

Polyclonal insmunoglobulins are prepared from immune sera either by precipitation with ammonium 

sulfate or by purification on inmiobilized Protein A (Pharmacia LKB Biotechnology, Piscataway, NJ.). 
35 Likewise, monoclonal antibodies are prepared from mouse ascites fiuid by anmionium sulfate precipitation or 

chroniatogrq>hy on inomobilized Protein A. Partially purified immunoglobulin is covalently attached to a 

chromatographic resin such as CnBr-activated SEPHAROSE^ (Pharmacia LKB Biotechnology). The antibody 
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is coupled to the resin, the resin is blocked, and the derivative resin is washed according to the manufacnirer's 
instniaions. 

Sudi an immunoaffinity column is utilized in the purification of PRO polypeptide by preparing a fraction 
from cells containing PRO polyj^ptide in a soluble form. This preparation is derived by solubilization of the 
whole cell or of a subcellular fraction obtained via differential cenirifugation by the addition of detergent or by 
5 other methods well known in the art. Alternatively, soluble PRO polypeptide containing a signal sequence may 
be secreted in useful quantity into the medium in which the cells are grown. 

A soluble PRO polypeptide-coniaining preparation is passed over the immunoaffinity column, and the 
colimm is washed under conditions that allow the preferential absorbance of PRO polypeptide (e.g. , high ionic 
strength buffers in the presence of detergent). Then, the column is eluted under conditions that disrupt 
10 aniibody/PRO polypeptide binding (e.g. . a low pH buffer such as approximately pH 2-3, or a high concentration 
of a chaoirope such as urea or thiocyanate ion), and PRO polypeptide is collected. 

EXAMPLE 146: Drue Screening 

This invention is particularly useful for screening compounds by using PRO polypeptides or binding 

15 fragment thereof in any of a variety of drug screening techniques. The PRO polypeptide or fragment employed 
in such a test may either be free in solution, affixed to a solid suppon, borne on a cell surface, or located 
intracellularly. One method of drug screenmg utilizes eukaryotic or prokaryotic host cells which arc stably 
transformed with recombmant nucleic acids expressing the PRO polypeptide or fragment. Drugs are screened 
against such transformed cells in compethive binding assays. Such cells, either in viable or fixed form, can be 

20 used for standard binding assays. One may measure, for example, the formation of complexes between PRO 
polypeptide or a fragment and the agent being tested. Alternatively, one can examine the diminution in complex 
formation between the PRO polypeptide and its target cell or target receptors caused by the agent being tested. 

Thus, the present invention provides methods of screening for drugs or any other agents which can 
affect a PRO polypepiide-associaied disease or disorder. These methods comprise contacting such an agent with 

25 an PRO polypeptide or fragment thereof and assaying (I) for the presence of a complex between the agent and 
the PRO polypeptide or fragment, or (ii) for the presence of a complex between the PRO polypeptide or fragment 
and the cell, by methods well known in the art. In such competitive binding assays, the PRO polypeptide or 
fragment is typically labeled. After suitable incubation, free PRO polypeptide or fragment is separated from that 
present in bound form, and the amount of free or uncomplexed label is a measure of the ability of the particular 

30 agent to bind to PRO polypeptide or to interfere with the PRO polypeptide/cell complex. 

Another technique for drug screening provides high throughput screenmg for compounds having suitable 
binding affinity to a polypeptide and is described in detail in WO 84/03564, published on September 13, 1984. 
Briefly stated, large numbers of different small peptide test compoimds are synthesized on a solid substrate, such 
as plastic pins or some other surface. As applied to a PRO polypeptide, the peptide test compounds are reacted 

35 with PRO polypeptide and washed. Bound PRO polypeptide is detected by methods well known in the art. 
Purified PRO polypeptide can also be coated directly onto plates for use in the aforementioned drug screening 
techniques. In addition, non-neutralizing antibodies can be used to capture the peptide and immobilize it on the 
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solid suppon. 

This invention also contemplates the use of competitive drug screening assays in which neutralizing 
antibodies capable of binding PRO polypeptide specifically compete with a test compound for binding to PRO 
polypeptide or fragments thereof. In this manner, the antibodies can be used to detect the presence of any 
pepude which shares one or more antigenic determinants with PRO polypeptide. 

5 

EXAMPLE 147: Rational Drug Design 

The goal of rational drug design is to produce structural analogs of biologically active polypeptide of 
interest {Le., a PRO polypeptide) or of small molecules with which they interact, e.g. , agonists, antagonists, or 
inhibitors. Any of these examples can be used to fashion drugs which are more active or stable forms of the 

1 0 PRO polypeptide or which enhance or interfere with the function of the PRO polypeptide in vivo (c.f. . Hodgson, 
Bio/Technology. 9: 19-21 (1991)). 

In one approach, the three-dimensional structure of the PRO polypeptide, or of an PRO 
polypeptide-inhibitor complex, is determined by x-ray crystallography, by computer modeling or, most typically, 
by a combination of the two approaches. Both the shape and charges of the PRO polypeptide must be ascenained 

15 to elucidate the structure and to determine active site(s) of the molecule. Less often, useful information regarding 
the structure of the PRO polypeptide may be gained by modeling based on the stnicmre of homologous proteins. 
In both cases, relevant structural information is used to design analogous PRO polypeptide-like molecules or to 
identify efficient inhibitors. Useful examples of rational drug design may include molecules which have improved 
activity or subility as shown by Braxton and Wells, Biochemistry. 31:7796-7801 (1992) or which act as 

20 inhibitors, agonists, or antagonists of native peptides as shown by Athauda et aL, J. Biochem.. Jii: 742-746 
(1993). 

It is also possible to isolate a target-specific antibody, selected by functional assay, as described above, 
and then to solve its crystal structure. This approach, in principle, yields a pharmacore upon which subsequent 
drug design can be based. It is possible to bypass protein crystallography altogether by generating anii-idioiypic 

25 antibodies (anti-ids) to a functional, pharmacologically active antibody. As a mirror image of a mirror image, 
the binding site of the anti-ids would be expected to be an analog of the original receptor. The anti-id could then 
be used to identify and isolate peptides from banks of chemically or biologically produced peptides. The isolated 
peptides would then act as the pharmacore. 

By virtue of the present invention, sufficient amounts of the PRO polypeptide may be made available 

30 to perform such analytical smdies as X-ray crystallography. In addition, knowledge of the PRO polypeptide 
amino acid sequence provided herein will provide guidance to those employing computer modeling techniques 
in place of or in addition to x-ray crystallography. 

35 
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Deposit of Material 

The following materials have been deposited with the American Type Culture Collection, 10801 
University Blvd.. Manassas. VA 20110-2209. USA (ATCC): 



Table 2 



10 



15 



20 



25 



30 



35 



40 



45 



50 



Material 


ATCC Deo. No. 


Deoosit E 


DNA 16422- 1209 


209929 


June 2, 1998 


DNA 16435- 1208 


209930 


June 2. 1998 


DNA21624-1391 


209917 


June 2. 1998 


DNA23334-1392 


209918 


June 2, 1998 


DNA26288-1239 


209792 


AprU21, 1998 


DNA26843-1389 


203099 


August 4, 1998 


DNA26844-1394 


209926 


June 2, 1998 


DNA30862-1396 


209920 


June 2, 1998 


DNA35680-1212 


209790 


April 21, 1998 


DNA4062M440 


209922 


June 2. 1998 


DNA4416M434 


209907 


May 27, 1998 


DNA44694-1500 


203114 


August 11, 1998 


DNA45495-1550 


203156 


August 25» 1998 


DNA47361-1154 


209431 


November 7, 1997 


DNA47394.1572 


203109 


August lU 1998 


DNA48320-1433 


209904 


May 27, 1998 


DNA48334-1435 


209924 


June 2. 1998 


DNA48606-1479 


203040 


July 1. 1998 


DNA49141-1431 


203003 


June 23, 1998 


DNA49142-1430 


203002 


June 23, 1998 


DNA49143-1429 


203013 


June 23, 1998 


DNA49647.1398 


209919 


June 2, 1998 


DNA49819-1439 


209931 


June 2, 1998 


DNA49820-1427 


209932 


June 2. 1998 


DNA49821-1562 


209981 


June 16, 1998 


DNA52 192-1369 


203042 


July 1. 1998 


DNA52598-1518 


203107 


August 11, 1998 


DNA539 13-1490 


203162 


August 25, 1998 


DNA53978-1443 


209983 


June 16, 1998 


DNA53996-1442 


209921 


June 2, 1998 


DNA5o041-1416 


203012 


June 23, 1998 


DNA56047-1456 


209948 


June 9, 1998 


DNA56050-1455 


203011 


June 23. 1998 


DNAS61 10-1437 


203113 


August 11, 1998 


UINA301 i3-i37o 


203049 


July 1, 1998 


DNA56410-1414 


209923 


June 2, 1998 


DNA56436-1448 


209902 


May 27, 1998 


DNA56855-1447 


203004 


June 23, 1998 


DNA56859-1445 


203019 


June 23, 1998 


DNA56860-1510 


209952 


June 9. 1998 


DNA56865-1491 


203022 


June 23, 1998 


DNAS6866-1342 


203023 


June 23, 1998 


DNA56868-1209 


203024 . 


June 23, 1998 


DNA56869.1545 


203161 


August 25, 1998 


DNA56870-1492 


209925 


June 2, 1998 


DNA57033.1403 


209905 


May 27. 1998 


DNA57037-1444 


209903 


May 27. 1998 


DNA57129-1413 


209977 


June 16. 1998 
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10 



15 



20 



25 



30 



35 



40 



DNA57690-1374 




Jimp 0 lOQft 

JUIIC 7f 1770 


DNA57693-1424 


203008 


June 23 1998 

J Hub 1770 


DNA57694-1341 


203017 


June 23 1998 


DNA57695-1340 


203006 


June 23 1998 


DNA57699-1412 


203020 


June 23 1998 


DNA57702-I476 


209951 


June 9 1998 

■flulw 7« 1 770 


DNA57704-1452 


209953 


June 9 1998 


DNA57708-1411 


203021 


June 23 1998 

JUUfr X770 


DNA5T710-1451 


203048 


Julv 1 1998 

* Uijr If 1 770 


DNA5771 1-1501 


203047 


Julv 1 1998 


DNA57827.1493 


203045 


Julv 1 1998 


DNA57834.1339 


209954 


June 9 1998 

7, 1 770 


DNA57836-1338 


203025 


June 23 1998 

JUUw ^v'f 1770 


DNA57838-1337 


203014 


Juni^ T\ lOOR 

JUUC l770 


DNA57844-1410 


203010 




DNA58721-1475 


203110 


Aiiffiicr 1 1 1QQS 

/MJgUol 11, l770 


DNA58723-1588 


203133 


/\llgU5l lOf l770 


DNA58737-1473 


203136 


AUgUol to, l770 


DNA58743-1609 ^ 


203154 


/\UglMl ^H^, 1770 


DNA58846-1409 


209957 


limp Q IQOft 
JUUC 7» 1770 


DNA58848-1472 


209955 


limp 0 IQQft 

JUUC 7, l770 


DNA5S849-i494 


209058 


JUnC 7, l770 


DNA58850-1495 


209956 


JUIlC 7, l770 


DNA58853-1423 




June zj, 1770 




')(YK(\ 1 ft 
jU 1 O 


June zj, I Wo 


DNA59205-1421 


203009 


limp 7'% lOOfi 
June ^j, 1770 


DNA59211-1450 


209960 


limp 0 10Q& 
June 7, i77o 


DNA59213-1487 


^V/77J7 


limp 0 100S 

June 7, l770 


DNA59214-1449 


203046 


Iiiiv 1 lOQfi 

July If 1770 


DNA592 15-1425 


209961 


limp 0 IQOft 
junc 7, i770 


DNA59220-1514 


209962 


limp 0 lOQft 
June 7, I77O 


DNA59488-1603 




/MlgUdl '•^^t I77O 


DNA59493-1420 


203050 


Iiilv \ lOQft 
JUiy 1, i77o 


DNA59497-1496 


209941 

*»W77*T i 


June 1770 


DNA59588-1571 


203106 


/\UgU5l 1 1 « i 770 


DNA59603-14I9 


209944 


limp 0 

JUlIw 7, 1770 


DNA59605-1418 


203005 


limp 7'^ 10QR 
June 1770 


DNA59606-1471 


209945 

*»W77^J 


limp 0 lOOR 
junL 7, 1770 


DNA59607-1497 


209957 


limp 0 

JUllw 7, 1770 


DNA59609-1470 


209963 


Jimp 9 10QK 

JUUC 7, 1770 


DNA59610-1559 


209990 


limp 1^ 109R 

JUiiC IV/, X770 


DNA59612-1466 


209947 


June 9 1 99ft 

JUItW 7f 1770 


DNA596i3-14l7 


203007 


June ^73 1998 


DNA59616-1465 


209991 


June 16 1998 


DNA59619-1464 


203041 


Julv 1 1998 


DNA59620-1463 


209989 


June 16 1998 


DNA59625-1498 


209992 


June 17 1998 


DNA59767-1489 


203108 


Auffust 11 1998 


DNA59776-1600 


203128 


/\U^Udi iO, A770 


DNA59777-1480 


203111 


Au9ti<ef 1 1 1998 

/>U^U3l 11} 1 770 


DNA59820-1549 


203129 


August 18, 1998 


DNA59827-1426 


203089 . 


August 4, 1998 


DN>VS9828-1608 


203158 


August 23, 1998 


DNA59838.1462 


209976 


June 16, 1998 


DNA59839-1461 


209988 


June 16. 1998 


DNA59841.1460 


203044 


July 1, 1998 


DNAS9842-1502 


209982 


June 16. 1998 
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DNA59846-1503 


209978 


June 16, 1998 


DNA59847-15U 


203098 


August 4. 1998 


DNA59848-1512 


203088 


August 4, 1998 


DNA59849-1504 


209986 


June 16, 1998 


DNA59853-1505 


209985 


June 16, 1998 


DNA59854-1459 


209974 


June 16, 1998 


DNA60283*1484 


203043 


July 1, 1998 


DNA6061S-1483 


2099^ 


June 16 1998 


DNA60619-1482 


209993 


June 16, 1998 


DNA60621-1516 


203091 


August 4, 1998 


DNA60622-1525 


203090 


Aueust 4 1998 


DNA60625-1507 


209975 


June 16 1998 


DNA60627-1508 


203092 


Aumst 4 1998 


DNA60629-1481 


209979 


June 16 1998 


DNA6 1755- 1554 


203112 


August 1 1 1998 


DNA6 1873- 1574 


203132 


August 18 1998 


DNA62814-1521 


203093 


Aurust 4 1998 


DNA62872-1509 


203100 


Au0mt 4 1998 

i»UgUOi 1770 


DNA62876-1517 


203095 


August 4 1998 


DNA62881-1515 


203096 


Aumst 4 1998 


DNA64852-1589 


203127 


Aupmt 18 1998 


DNA64884-1527 


203155 


August 25 1998 

^\UKUSi ^ A 770 


DNA64890-1612 


203131 


Auouct 18 1998 

<%UgUdl lO, A 770 


DNA65412-1523 


203094 


August 4, 1998 


DNA66308-1537 


203159 


August 25. 1998 


DNA66309-1538 


203235 


September 15, 1998 


DNA67004-1614 


203115 


August 11, 1998 


DNA68869-1610 


203164 


August 25, 1998 


DNA68872-1620 


203160 


August 25, 1998 


DNA71 159.1617 


203135 


August 18, 1998 



30 

These deposit were made under the provisions of the Budapest Treaty on the International Recognition 
of the Deposit of Microorganisms for the Purpose of Patent Procedure and the Regulations thereunder (Budapest 
Treaty). This assures maintenance of a viable culture of the deposit for 30 years from the date of deposit. The 
deposits will be made available by ATCC under the terms of the Budapest Treaty, and subject to an agreement 

35 between Genemech, Inc. and ATCC, which assures permanent and unrestricted availability of the progeny of 
the culture of the deposit to the public upon issuance of the pertinent U.S. patent or upon laying open to the 
public of any U.S. or foreign patent application, whichever comes first, and assures availability of the progeny 
to one determined by the U.S. Commissioner of Patents and Trademarks to be entitled thereto according to 35 
use §122 and the Commissioner's rules pursuant thereto (including 37 CFR §1.14 with particular reference to 

40 886 OG 638). 

The assignee of the present application has agreed that if a culmre of the materials on deposit should 
die or be lost or destroyed when cultivated luider suitable conditions, the materials will be prompdy replaced on 
notification with another of the same. Availability of the deposited material is not to be construed as a license 
to practice the invention in contravention of the rights granted under the authority of any government in 
45 accordance with its patent laws. 

The foregoing written specification is considered to be sufficiem to enable one skilled in the art to 
practice the invention. The present invention is not to be limited in scope by the construct deposited, since the 
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deposited embodiment is intended as a single illustration of certain aspects of the invention and any constructs 
thai are functionally equivalent are within the scope of this invention. The deposit of material herein does not 
constitute an admission that the written description herein contained is inadequate to enable the practice of any 
aspect of the invention, including the best mcKie thereof, nor is it to be construed as limiting the scope of the 
claims to the specific illustrations that it represents. Indeed, various modifications of the invention in addition 
to those shown and described herein will become apparent to those skilled in the an from the foregoing 
description and fall within the scope of the appended claims. 
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WHAT IS CLAIMED ISr 

1. Isolated nucleic acid having at least 80% sequence identity to a nucleotide sequence that 
encodes a polypeptide comprising an amino acid sequence selected from the group consisting of the amino acid 
sequence shown in Figure 2 (SEQ ID N0:2). Figure 4 (SEQ ID NO:6), Figure 6 (SEQ ID N0:8), Figure 9 
(SEQ ID NO: 14). Figure 12 (SEQ ID NO:20). Figure 15 (SEQ ID NO:23). Figure 18 (SEQ ID NO:28), Figure 
5 20 (SEQ ID NO:30), Figure 23 (SEQ ID NO:33). Figure 25 (SEQ ID NO:36). Figure 27 (SEQ ID N0:4I). 
Figure 30 (SEQ ID NO:47), Figure 32 (SEQ ID NO:52). Figure 34 (SEQ ID NO:57). Figure 36 (SEQ ID 
NO:62). Figure 38 (SEQ ID NO:67). Figure 41 (SEQ ID NO:73). Figure 47 (SEQ ID NO:84), Figure 49 (SEQ 
ID NO:95). Figure 51 (SEQ ID NO:97), Figure 53 (SEQ ID NO:99). Figure 57 (SEQ ID NO: 103). Figure 64 
(SEQ ID NO: 113). Figure 66 (SEQ ID NO: 1 15), Figure 68 (SEQ ID NO: 117). Figure 70 (SEQ ID NO: 1 19). 
10 Figure 72 (SEQ ID NO: 124). Figure 74 (SEQ ID NO: 129). Figure 76 (SEQ ID NO: 135). Figure 79 (SEQ ID 
NO: 138). Figure 83 (SEQ ID NO: 146). Figure 85 (SEQ ID NO: 148). Figure 88 (SEQ ID NO: 151). Figure 90 
(SEQ ID NO: 153). Figure 93 (SEQ ID NO: 156). Figure 95 (SEQ ID NO: 158). Figure 97 (SEQ ID NO: 160). 
Figure 99 (SEQ ID NO: 165). Figure 101 (SEQ ID NO: 167). Figure 103 (SEQ ID NO: 169). Figure 105 (SEQ 
ID NO: 171). Figure 109 (SEQ ID NO: 175). Figure 111 (SEQ ID NO: 177). Figure 113 (SEQ ID NO: 179). 
15 Figure 115 (SEQ ID NO: 181). Figure 117 (SEQ ID NO: 183). Figure 120 (SEQ ID NO: 189). Figure 122 (SEQ 
ID NO:194), Figure 125 (SEQ ID NO:197). Figure 127 (SEQ ID NO: 199). Figure 129 (SEQ ID NO:201). 
Figure 131 (SEQ ID NO:203). Figure 133 (SEQ ID NO:205). Figure 135 (SEQ ID NO:207). Figure 137 (SEQ 
ID NO:209), Figure 139 (SEQ ID N0:211). Figure 141 (SEQ ID NO:213). Figure 144 (SEQ ID NO:216). 
Figure 147 (SEQ ID NO:219). Figure 149 (SEQ ID NO:221). Figure 151 (SEQ ID NO:223). Figure 153 (SEQ 
20 ID NO:225). Figure 155 (SEQ ID NO:227), Figure 157 (SEQ ID NO:229). Figure 159 (SEQ ID NO:231), 
Figure 161 (SEQ ID NO:236). Figure 163 (SEQ ID NO:241). Figure 165 (SEQ ID NO:246). Figure 167 (SEQ 
ID NO:248). Figure 169 (SEQ ID NO:250), Figure 171 (SEQ ID NO:253). Figure 174 (SEQ ID NO:256). 
Figure 176 (SEQ ID NO:258). Figure 178 (SEQ ID NO:260). Figure 180 (SEQ ID NO:262), Figure 182 (SEQ 
ID NO:264). Figure 184 (SEQ ID NO:266). Figure 186 (SEQ ID NO:268). Figure 188 (SEQ ID NO:270), 
25 Figure 190 (SEQ ID NO:272), Figure 192 (SEQ ID NO:274). Figure 194 (SEQ ID NO:276). Figure 196 (SEQ 
ID NO:278). Figure 198 (SEQ ID NO:281). Figure 200 (SEQ ID NO:283). Figure 202 (SEQ ID NO:285), 
Figure 204 (SEQ ID NO:287). Figure 206 (SEQ ID NO:289). Figure 208 (SEQ ID NO:291). Figure 210 (SEQ 
ID NO:293). Figure 212 (SEQ ID NO:295). Figure 214 (SEQ ID NO:297). Figure 216 (SEQ ID NO:299). 
Figure 218 (SEQ ID NO:301), Figure 220 (SEQ ID NO:303). Figure 226 (SEQ ID NO:309). Figure 228 (SEQ 
30 ID NO:314). Figure 230 (SEQ ID NO:319). Figure 233 (SEQ ID NO:326). Figure 235 (SEQ ID NO:334). 
Figure 238 (SEQ ID NO:340). Kgure 240 (SEQ ID NO:345). Figure 242 (SEQ ID NO:347). Figure 244 (SEQ 
ID NO:349). Figure 246 (SEQ ID NO:351). Figure 248 (SEQ ID NO:353). Figure 250 (SEQ ID NO:355). 
Figure 252 (SEQ ID NO:357). Figure 254 (SEQ ID NO:359). Figure 256 (SEQ ID NO:361). Figure 258 (SEQ 
ID NO:363). Figure 260 (SEQ ID NO:365), Figure 262 (SEQ ID NO:367). Figure 264 (SEQ ID NO:369). 
35 Figure 266 (SEQ ID NO:371 ), Figure 268 (SEQ ID NO:373). Figure 270 (SEQ ID NO:375). Figure 272 (SEQ 
ID NO:377), Figure 274 (SEQ ID NO:379). Figure 276 (SEQ ID NO:38l). Figure 278 (SEQ ID NO:387). 
Figure 280 (SEQ ID NO:389). Figure 282 (SEQ ID NO:394). Figure 284 (SEQ ID NO:399). Figure 286 (SEQ 
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ID NO:40i). Figure 288 (SEQ ID NO:403). Figure 290 (SEQ ID NO:408). Figure 292 (SEQ ID NO:410). 
Figure 294 (SEQ ID NO:412). Figure 296 (SEQ ID NO:4I4). Figure 298 (SEQ ID NO:416). Figure 300 (SEQ 
ID NO:418). Figure 302 (SEQ ID NO:420). Figure 304 (SEQ ID NO:422) and Figure 306 (SEQ ID NO:424). 

2. The nucleic acid sequence of Claim 1 , wherein said nucleotide sequence comprises a nucleotide 
5 sequence selected from the group consisting of the sequence shown in Figure 1 (SEQ ID NO: 1), Figure 3 (SEQ 
ID NO:5). Figure 5 (SEQ ID N0:7), Figure 8 (SEQ ID NO: 13). Figure 1 1 (SEQ ID NO: 19). Figure 14 (SEQ 
ID NO:22). Figure 17 (SEQ ID NO:27). Figure 19 (SEQ ID NO:29). Figure 22 (SEQ ID NO:32). Figure 24 
(SEQ ID NO:35). Figure 26 (SEQ ID NO:40). Figure 29 (SEQ ID NO:46). Figure 31 (SEQ ID N0:51). Figure 
33 (SEQ ID NO:56). Figure 35 (SEQ ID N0:6I), Figure 37 (SEQ ID NO:66). Figure 40 (SEQ ID NO:72). 
10 Figure 46 (SEQ ID NO:83). Figure 48 (SEQ ID NO:94). Figure 50 (SEQ ID NO:96). Figure 52 (SEQ ID 
NO:98). Figure 56 (SEQ ID NO: 102). Figure 63 (SEQ ID NO: 1 12). Figure 65 (SEQ ID NO: 1 14). Figure 67 
(SEQ ID NO: 1 16).. Figure 69 (SEQ ID NO: 118). Figure 71 (SEQ ID NO: 123). Figure 73 (SEQ ID NO: 128). 
Figure 75 (SEQ ID NO: 134). Figure 78 (SEQ ID NO: 137). Figure 82 (SEQ ID NO: 145). Figure 84 (SEQ ID 
NO: 147). Figure 87 (SEQ ID NO:I50). Figure 89 (SEQ ID NO: 152). Figure 92 (SEQ ID NO: 155), Figure 94 

15 (SEQ ID NO: 157). Figure 96 (SEQ ID NO: 159), Figure 98 (SEQ ID NO: 164). Figure 100 (SEQ ID NO: 166). 
Figure 102 (SEQ ID NO: 168). Figure 104 (SEQ ID NO: 170). Figure 108 (SEQ ID NO: 174), Figure 110 (SEQ 
ID NO:176). Figure 112 (SEQ ID NO:178), Figure 114 (SEQ ID NO:180). Figure 116 (SEQ ID NO:182). 
Figure 119(SEQID NO:188). Figure 121 (SEQ ID NO: 193). Figure 124 (SEQ ID NO: 196). Figure 126 (SEQ 
ID NO: 198), Figure 128 (SEQ ID NO:200), Figure 130 (SEQ ID NO:202), Figure 132 (SEQ ID NO:204). 

20 Figure 134 (SEQ ID NO:206). Figure 136 (SEQ ID NO:208). Figure 138 (SEQ ID N0:210), Figure 140 (SEQ 
ID NO:212). Figure 143 (SEQ ID NO:215), Figure 146 (SEQ ID NO:218). Figure 148 (SEQ ID NO:220). 
Figure 150 (SEQ ID NO:222). Figure 152 (SEQ ID NO:224). Figure 154 (SEQ ID NO:226), Figure 156 (SEQ 
ID NO:228), Figure 158 (SEQ ID NO:230). Figure 160 (SEQ ID NO:235), Figure 162 (SEQ ID NO:240), 
Figure 164 (SEQ ID NO:245), Figure 166 (SEQ ID NO:247). Figure 168 (SEQ ID NO:249), Figure 170 (SEQ 

25 ID NO:252). Figure 173 (SEQ ID NO:255), Figure 175 (SEQ ID NO:257). Figure 177 (SEQ ID NO:259), 
Figure 179 (SEQ ID NO:261), Figure 181 (SEQ ID NO:263), Figure 183 (SEQ ID NO:265), Figure 185 (SEQ 
ID NO:267). Figure 187 (SEQ ID NO:269). Figure 189 (SEQ ID NO:27I). Figure 191 (SEQ ID NO:273), 
Figure 193 (SEQ ID NO:275), Figure 195 (SEQ ID NO:277), Figure 197 (SEQ ID NO:280), Figure 199 (SEQ 
ID NO:282). Figure 201 (SEQ ID NO:284). Figure 203 (SEQ ID NO:286). Figure 205 (SEQ ID NO:288). 

30 Figure 207 (SEQ ID NO:290). Figure 209 (SEQ ID NO:292). Figure 211 (SEQ ID NO:294). Figure 213 (SEQ 
ID NO:296). Figure 215 (SEQ ID NO:298). Figure 217 (SEQ ID NO:300). Figure 219 (SEQ ID NO:302). 
Figure 225 (SEQ ID NO:308), Figure 227 (SEQ ID NO:313), Figure 229 (SEQ ID NO:318). Figure 232 (SEQ 
ID NO:325). Figure 234 (SEQ ID NO:333). Figure 237 (SEQ ID NO:339). Figure 239 (SEQ ID NO:344). 
Figure 241 (SEQ ID NO:346), Figure 243 (SEQ ID NO:348). Figure 245 (SEQ ID NO:350). Figure 247 (SEQ 

35 ID NO:352). Figure 249 (SEQ ID NO:354). Figure 251 (SEQ ID NO:356). Figure 253 (SEQ ID NO:358). 
Figure 255 (SEQ ID NO:360), Figure 257 (SEQ ID NO:362), Figure 259 (SEQ ID NO:364). Figure 261 (SEQ 
ID NO:366). Figure 263 (SEQ ID NO:368), Figure 265 (SEQ ID NO:370). Figure 267 (SEQ ID NO:372). 
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Figure 269 (SEQ ID NO:374), Figure 271 (SEQ ID NO:376), Figure 273 (SEQ ID NO:378), Figure 275 (SEQ 
ID NO:380). Figure 277 (SEQ ID NO:386), Figure 279 (SEQ ID NO:388). Figure 281 (SEQ ID NO:393), 
Figure 283 (SEQ ID NO:398), Figure 285 (SEQ ID NO:400), Figure 287 (SEQ ID NO:402), Figure 289 (SEQ 
ID NO:407). Figure 291 (SEQ ID NO:409), Figure 293 (SEQ ID N0:411), Figure 295 (SEQ ID NO:413), 
Figure 297 (SEQ ID NO:415). Figure 299 (SEQ ID NO:417), Figure 301 (SEQ ID NO:419), Figure 303 (SEQ 
5 ID NO:421) and Figure 305 (SEQ ID NO:423). 

3. The nucleic acid of Claim 1 , wherein said nucleotide sequence comprises a nucleotide sequence 
selected from the group consisting of the full-length coding sequence of the sequence shown in Figure I (SEQ 
ID NO: 1). Figure 3 (SEQ ID N0:5), Figure 5 (SEQ ID NO:7), Figure 8 (SEQ ID NO: 13). Figure 1 1 (SEQ ID 

10 NO: 19), Figure 14 (SEQ ID NO:22), Figure 17 (SEQ ID NO:27), Figure 19 (SEQ ID NO:29), Figure 22 (SEQ 
ID NO:32). Figure 24 (SEQ ID NO:35). Figure 26 (SEQ ID NO:40). Figure 29 (SEQ ID NO:46), Figure 31 
(SEQ ID N0:51), Figure 33 (SEQ ID NO:56), Figure 35 (SEQ ID N0:61). Figure 37 (SEQ ID NO:66), Figure 
40 (SEQ ID NO:72). Figure 46 (SEQ ID NO:83), Figure 48 (SEQ ID NO:94). Figure 50 (SEQ ID N0:%). 
Figure 52 (SEQ ID NO:98). Figure 56 (SEQ ID NO: 102), Figure 63 (SEQ ID NO: 112), Figure 65 (SEQ ID 

15 NO: 1 14), Figure 67 (SEQ ID N0:116), Figure 69 (SEQ ID NO: 1 18), Figure 71 (SEQ ID NO: 123). Figure 73 
(SEQ ID NO: 128). Figure 75 (SEQ ID NO: 134), Figure 78 (SEQ ID NO: 137). Figure 82 (SEQ ID NO: 145), 
Figure 84 (SEQ ID NO: 147). Figure 87 (SEQ ID NO: 150), Figure 89 (SEQ ID NO: 152), Figure 92 (SEQ ID 
NO: 155), Figure 94 (SEQ ID NO: 157), Figure 96 (SEQ ID NO: 159), Figure 98 (SEQ ID NO: 164), Figure 100 
(SEQ ID NO: 166), Figure 102 (SEQ ID NO: 168). Figure 104 (SEQ ID NO: 170). Figure 108 (SEQ ID 

20 NO: 174), Figure 1 10 (SEQ ID NO: 176), Figure 1 12 (SEQ ID NO: 178), Figure 1 14 (SEQ ID NO: 180), Figure 
116 (SEQ ID NO:182), Figure 119 (SEQ ID NO:188), Figure 121 (SEQ ID NO:193). Figure 124 (SEQ ID 
NO: 196). Figure 126 (SEQ ID NO: 198). Figure 128 (SEQ ID NO:200), Figure 130 (SEQ ID NO:202), Figure 
132 (SEQ ID NO:204), Figure 134 (SEQ ID NO:206). Figure 136 (SEQ ID NO:208), Figure 138 (SEQ ID 
N0:210). Figure 140 (SEQ ID NO:212). Figure 143 (SEQ ID NO:215), Figure 146 (SEQ ID NO:218), Figure 

25 148 (SEQ ID NO:220), Figure 150 (SEQ ID NO:222). Figure 152 (SEQ ID NO:224), Figure 154 (SEQ ID 
NO:226). Figure 156 (SEQ ID NO:228). Figure 158 (SEQ ID NO:230), Figure 160 (SEQ ID NO:235). Figure 
162 (SEQ ID NO:240), Figure 164 (SEQ ID NO:245), Figure 166 (SEQ ID NO:247), Figure 168 (SEQ ID 
NO:249), Figure 170 (SEQ ID NO:252). Figure 173 (SEQ ID NO:255). Figure 175 (SEQ ID NO:257). Figure 
177 (SEQ ID NO:259), Figure 179 (SEQ ID NO:261). Figure 181 (SEQ ID NO:263), Figure 183 (SEQ ID 

30 NO:265). Figure 185 (SEQ ID NO:267), Figure 187 (SEQ ID NO:269). Figure 189 (SEQ ID NO:271). Figure 
191 (SEQ ID NO:273), Figure 193 (SEQ ID NO:275), Figure 195 (SEQ ID NO:277), Figure 197 (SEQ ID 
NO:280), Figure 199 (SEQ ID NO:282), Figure 201 (SEQ ID NO:284), Figure 203 (SEQ ID NO:286), Figure 
205 (SEQ ID NO:288). Figure 207 (SEQ ID NO:290), Figure 209 (SEQ ID NO:292), Figure 211 (SEQ ID 
NO;294), Figure 213 (SEQ ID NO:296), Figure 215 (SEQ ID NO:298). Figure 217 (SEQ ID NO:300), Figure 

35 219 (SEQ ID NO:302), Figure 225 (SEQ ID NO:308), Figure 227 (SEQ ID NO:313), Figure 229 (SEQ ID 
NO:318). Figure 232 (SEQ ID NO:325). Figure 234 (SEQ ID NO:333), Figure 237 (SEQ ID NO:339), Figure 
239 (SEQ ID NO:344), Figure 241 (SEQ ID NO:346). Figure 243 (SEQ ID NO:348), Figure 245 (SEQ ID 
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NO:350). Figure 247 (SEQ ID NO:352), Figure 249 (SEQ ID NO:354), Figure 251 (SEQ ID NO:356), Figure 
253 (SEQ ID NO:358), Figure 255 (SEQ ID NO:360), Figure 257 (SEQ ID NO:362)» Figure 259 (SEQ ID 
NO:364)» Figure 261 (SEQ ID NO:366), Figure 263 (SEQ ID NO:368), Figure 265 (SEQ ID NO:370), Figure 
267 (SEQ ID NO:372). Figure 269 (SEQ ID NO:374). Figure 271 (SEQ ID NO:376), Figure 273 (SEQ ID 
NO:378), Figure 275 (SEQ ID NO:380), Figure 277 (SEQ ID NO:386). Figure 279 (SEQ ID NO:388), Figure 
5 281 (SEQ ID NO:393). Figure 283 (SEQ ID NO:398). Figure 285 (SEQ ID NO:400), Figure 287 (SEQ ID 
NO:402), Figure 289 (SEQ ID NO:407), Figure 291 (SEQ ID NO:409), Figure 293 (SEQ ID N0:41 1), Figure 
295 (SEQ ID NO:413), Figure 297 (SEQ ID N0:415). Figure 299 (SEQ ID N0:417), Figure 301 (SEQ ID 
NO:419). Figure 303 (SEQ ID NO:421) or Figure 305 (SEQ ID NO:423). 

10 . 4. Isolated nucleic acid which comprises the ftiU-lengih coding sequence of die DNA deposited 

under any ATCC accession number shown in Table 2. 

5. A vector comprising the nucleic acid of Claim 1 . 

15 6. The vector of Claim 5 operably linked to control sequences recognized by a host cell 

transformed with the vector. 

7. A host cell comprising the vector of Claim 5. 

20 8. The host cell of Claim 7 wherein said cell is a CHO cell. 

9. The host cell of Claim 7 wherein said cell is an £. coli. 

10. The host cell of Claim 7 wherein said cell is a yeast cell. 

25 

11. A process for producing a PRO polypeptides comprising culturing the host cell of Claim 7 
under conditions suitable for expression of said PRO polypeptide and recovering said PRO polypeptide from the 
cell culture. 

30 12. Isolated PRO polypeptide having at least 80% sequence identity to an amino acid sequence 

selected from the group consisting of the amino acid sequence shown in Figure 2 (SEQ ID N0:2), Figure 4 (SEQ 
ID N0:6). Figure 6 (SEQ ID NO:8), Figure 9 (SEQ ID NO: 14), Figure 12 (SEQ ID NO:20), Figure 15 (SEQ 
ID NO:23), Figure 18 (SEQ ID NO:28), Figure 20 (SEQ ID NO:30), Figure 23 (SEQ ID NO:33), Figure 25 
(SEQ ID NO:36), Figure 27 (SEQ ID N0:41), Figure 30 (SEQ ID NO:47), Figure 32 (SEQ ID NO:52). Figure 

35 34 (SEQ ID NO:57). Figure 36 (SEQ ID NO:62). Figure 38 (SEQ ID NO:67). Figure 41 (SEQ ID NO:73). 
Figure 47 (SEQ ID NO:84), Figure 49 (SEQ ID NO;95), Figure 51 (SEQ ID NO:97), Figure 53 (SEQ ID 
NO:99), Figure 57 (SEQ ID NO: 103), Figure 64 (SEQ ID NO: 1 13). Figure 66 (SEQ ID NO: 1 15), Figure 68 
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(SEQ ID NO: 117), Figure 70 (SEQ ID NO: 119), Figure 72 (SEQ ID NO: 124), Figure 74 (SEQ ID NO: 129). 
Figure 76 (SEQ ID NO: 135), Figure 79 (SEQ ID NO: 138), Figure 83 (SEQ ID NO: 146), Figure 85 (SEQ ID 
NO: 148), Figure 88 (SEQ ID NO: 15 1), Figure 90 (SEQ ID NO: 153). Figure 93 (SEQ ID NO: 156), Figure 95 
(SEQ ID NO: 158), Figure 97 (SEQ4D NO: 160). Figure 99 (SEQ ID NO: 165), Figure 101 (SEQ ID NO:i67), 
Figure 103 (SEQ ID NO: 169). Figure 105 (SEQ ID NO: 171), Figure 109 (SEQ ID NO: 175), Figure 1 1 1 (SEQ 
5 ID NO:177). Figure 113 (SEQ ID N0:179), Figure 115 (SEQ ID N0:181), Figure 117 (SEQ ID NO:183), 
Figure 120 (SEQ ID NO: 189), Figure 122 (SEQ ID NO: 194), Figure 125 (SEQ ID NO: 197), Figure 127 (SEQ 
ID NO:199). Figure 129 (SEQ ID NO:201), Figure 131 (SEQ ID NO:203), Figure 133 (SEQ ID NO:205). 
Figure 135 (SEQ ID NO:207). Figure 137 (SEQ ID NO:209), Figure 139 (SEQ ID N0:21 1), Figure 141 (SEQ 
ID NO:213), Figure 144 (SEQ ID NO:216), Figure 147 (SEQ ID NO:219), Figure 149 (SEQ ID NO:221). 

10 Figure 151 (SEQ ID NO:223), Figure 153 (SEQ ID NO:225), Figure 155 (SEQ ID NO:227), Figure 157 (SEQ 
ID NO:229). Figure 159 (SEQ ID NO:231), Figure 161 (SEQ ID NO:236), Figure 163 (SEQ ID NO:241), 
Figure 165 (SEQ ID NO:246), Figure 167 (SEQ ID NO:248), Figure 169 (SEQ ID NO:250). Figure 171 (SEQ 
ID NO:253), Figure 174 (SEQ ID NO:256). Figure 176 (SEQ ID NO:258), Figure 178 (SEQ ID NO:260), 
Figure 180 (SEQ ID NO:262), Figure 182 (SEQ ID NO:264). Figure 184 (SEQ ID NO:266), Figure 186 (SEQ 

15 ID NO:268), Figure 188 (SEQ ID NO:270), Figure 190 (SEQ ID NO:272), Figure 192 (SEQ ID NO:274), 
Figure 194 (SEQ ID NO:276), Figure 196 (SEQ ID NO:278). Figure 198 (SEQ ID NO:28i), Figure 200 (SEQ 
ID NO:283). Figure 202 (SEQ ID NO:285), Figure 204 (SEQ ID NO:287), Figure 206 (SEQ ID NO:289), 
Figure 208 (SEQ ID NO:291), Figure 210 (SEQ ID NO:293), Figure 212 (SEQ ID NO:295), Figure 214 (SEQ 
ID NO:297), Figure 216 (SEQ ID NO:299), Figure 218 (SEQ ID NO:301), Figure 220 (SEQ ID NO:303), 

20 Figure 226 (SEQ ID NO:309), Figure 228 (SEQ ID NO:314), Figure 230 (SEQ ID NO:319), Figure 233 (SEQ 
ID NO:326), Figure 235 (SEQ ID NO:334), Figure 238 (SEQ ID NO:340), Figure 240 (SEQ ID NO:345). 
Figure 242 (SEQ ID NO:347), Figure 244 (SEQ ID NO:349), Figure 246 (SEQ ID NO:351). Figure 248 (SEQ 
ID NO:353), Figure 250 (SEQ ID NO:355), Figure 252 (SEQ ID NO:357), Figure 254 (SEQ ID NO:359), 
Figure 256 (SEQ ID NO:361), Figure 258 (SEQ ID NO:363), Figure 260 (SEQ ID NO:365), Figure 262 (SEQ 

25 ID NO:367), Figure 264 (SEQ ID NO:369), Figure 266 (SEQ ID NO:371), Figure 268 (SEQ ID NO:373), 
Figure 270 (SEQ ID NO:375). Figure 272 (SEQ ID NO:377), Figure 274 (SEQ ID NO:379), Figure 276 (SEQ 
ID NO:381), Figure 278 (SEQ ID NO:387). Figure 280 (SEQ ID NO:389), Figure 282 (SEQ ID NO:394). 
Figure 284 (SEQ ID NO:399), Figure 286 (SEQ ID NO:401), Figure 288 (SEQ ID NO:403). Figure 290 (SEQ 
ID NO:408), Figure 292 (SEQ ID NO:410). Figure 294 (SEQ ID NO:412), Figure 296 (SEQ ID NO:414), 

30 Figure 298 (SEQ ID NO:416). Figure 300 (SEQ ID NO:418), Figure 302 (SEQ ID NO:420), Figure 304 (SEQ 
ID NO:422) and Figure 306 (SEQ ID NO:424). 

13. Isolated PRO polypeptide having at least 80% sequence identity to the amino acid sequence 
encoded by a nucleic acid molecule deposited under.any ATCC accession number shown in Table 2. 

35 

14. A chimeric molecule comprising a polypeptide according to Claim 12 fused to a heterologous 
amino acid sequence. 
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15. The chimeric molecule of Claim 14 wherein said heterologous amino acid sequence is an 
epitope tag sequence. 

16. The chimeric molecule of Claim 14 wherein said heterologous amino acid sequence is a Fc 
region of an inmiunoglobulin. 

5 

17. An antibody which specifically binds to a PRO polypeptide according to Claim 12. 

18. The antibody of Claim 17 wherein said antibody is a monoclonal antibody. 
10 19. The antibody of Claim 17 wherein said antibody is a humanized antibody. 

20. The antibody of Claim 17 wherein said antibody is an antibody fragment. 

21 . An isolated nucleic acid molecule which has at least 80% sequence identity to a nucleic acid 
IS which comprises a nucleotide sequence selected from the group consisting of diat shown m Figure 1 (SEQ ID 

N0:1), Figure 3 (SEQ ID NO:5). Figure 5 (SEQ ID N0:7). Figure 8 (SEQ ID NO: 13), Figure 1 1 (SEQ ID 
NO: 19), Figure 14 (SEQ ID NO:22), Figure 17 (SEQ ID NO:27), Figure 19 (SEQ ID NO:29), Figure 22 (SEQ 
ID NO:32), Figure 24 (SEQ ID NO:35), Figure 26 (SEQ ID N0:40), Figure 29 (SEQ ID NO:46). Figure 31 
(SEQ ID NO:51), Figure 33 (SEQ ID NO:56)» Figure 35 (SEQ ID NO:61), Figure 37 (SEQ ID NO:66), Figure 

20 40 (SEQ ID NO:72). Figure 46 (SEQ ID NO:83). Figure 48 (SEQ ID NO:94), Figure 50 (SEQ ID NO:96), 
Figure 52 (SEQ ID NO:98). Figure 56 (SEQ ID NO: 102), Figure 63 (SEQ ID NO: 112). Figure 65 (SEQ ID 
NO: 1 14). Figure 67 (SEQ ID NO: 1 16). Figure 69 (SEQ ID NO: 1 18), Figure 71 (SEQ ID NO: 123). Figure 73 
(SEQ ID NO: 128), Figure 75 (SEQ ID NO: 134), Figure 78 (SEQ ID NO: 137). Figure 82 (SEQ ID NO: 145). 
Figure 84 (SEQ ID NO: 147). Figure 87 (SEQ ID NO: 150). Figure 89 (SEQ ID NO: 152). Figure 92 (SEQ ID 

25 NO: 155). Figure 94 (SEQ ID NO: 157). Figure 96 (SEQ ID NO: 159). Figure 98 (SEQ ID NO: 164). Figure 100 
(SEQ ID NO: 166), Figure 102 (SEQ ID NO: 168), Figure 104 (SEQ ID NO: 170), Figure 108 (SEQ ID 
NO: 174), Figure 1 10 (SEQ ID NO: 176), Figure 1 12 (SEQ ID NO: 178), Figure 114 (SEQ ID NO: 180). Figure 
1 16 (SEQ ID NO: 182), Figure 119 (SEQ ID NO: 188). Figure 121 (SEQ ID NO: 193), Figure 124 (SEQ ID 
NO: 196). Figure 126 (SEQ ID NO: 198), Figure 128 (SEQ ID NO:200). Figure 130 (SEQ ID NO:202). Figure 

30 132 (SEQ ID NO:204). Figure 134 (SEQ ID NO:206), Figure 136 (SEQ ID NO:208), Figure 138 (SEQ ID 
NO:210), Figure 140 (SEQ ID NO:212), Figure 143 (SEQ ID NO:215), Figure 146 (SEQ ID NO:218), Figure 
148 (SEQ ID NO:220), Figure 150 (SEQ ID NO:222), Figure 152 (SEQ ID NO:224), Figure 154 (SEQ ID 
NO:226), Figure 156 (SEQ ID NO:228). Figure 158 (SEQ ID NO:230). Figure 160 (SEQ ID NO:235). Figure 
162 (SEQ ID NO:240), Figure 164 (SEQ ID NO:245), Figure 166 (SEQ ID NO:247). Figure 168 (SEQ ID 

35 NO:249), Figure 170 (SEQ ID NO:252), Figure 173 (SEQ ID NO:255). Figure 175 (SEQ ID NO:257). Figure 
177 (SEQ ID NO:2S9), Figure 179 (SEQ ID NO:261), Figure 181 (SEQ ID NO:263), Figure 183 (SEQ ID 
NO:265), Figure 185 (SEQ ID NO:267), Figure 187 (SEQ ID NO:269), Figure 189 (SEQ ID NO:271), Figure 
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191 (SEQ ID NO:273), Figure 193 (SEQ ID NO:275). Figure 195 (SEQ ID NO:277), Figure 197 (SEQ ID 
NO:280). Figure 199 (SEQ ID NO:282). Figure 201 (SEQ ID NO:284). Figure 203 (SEQ ID NO:286), Figure 
205 (SEQ ID NO:288). Figure 207 (SEQ ID NO:290), Figure 209 (SEQ ID NO:292). Figure 211 (SEQ ID 
NO:294), Figure 213 (SEQ ID NO:296), Figure 215 (SEQ ID NO:298), Figure 217 (SEQ ID NO:300), Figure 
219 (SEQ ID NO:302), Figure 225 (SEQ ID NO:308), Figure 227 (SEQ ID NO:313). Figure 229 (SEQ ID 
5 NO:318), Figure 232 (SEQ ID NO:325), Figure 234 (SEQ ID NO:333). Figure 237 (SEQ ID NO:339), Figure 
239 (SEQ ID NO:344), Figure 241 (SEQ ID NO:346). Figure 243 (SEQ ID NO:348). Figure 245 (SEQ ID 
NO:350), Figure 247 (SEQ ID NO:352). Figure 249 (SEQ ID NO:354), Figure 251 (SEQ ID NO:356), Figure 
253 (SEQ ID NO:358), Figure 255 (SEQ ID NO:360), Figure 257 (SEQ ID NO:362), Figure 259 (SEQ ID 
NO:364), Figure 261 (SEQ ID NO:366). Figure 263 (SEQ ID Nb:368), Figure 265 (SEQ ID NO:370). Figure 

10 267 (SEQ ID NO:372), Figure 269 (SEQ ID NO:374). Figure 271 (SEQ ID NO:376), Figure 273 (SEQ ID 
NO:378), Figure 275 (SEQ ID NO:380), Figure 277 (SEQ ID NO:386), Figure 279 (SEQ ID NO:388). Figure 
281 (SEQ ID NO:393), Figure 283 (SEQ ID NO:398), Figure 285 (SEQ ID NO:400), Figure 287 (SEQ ID 
NO:402). Figure 289 (SEQ ID NO:407), Figure 291 (SEQ ID NO:409). Figure 293 (SEQ ID N0:41 1). Figure 
295 (SEQ ID NO:413), Figure 297 (SEQ ID NO:415). Figure 299 (SEQ ID NO:417), Figure 301 (SEQ ID 

15 NO:419), Figure 303 (SEQ ID NO:421) and Figure 305 (SEQ ID NO:423). 

22. An isolated nucleic acid molecule which has at least 80% sequence identity to the full-length 
coding sequence of a nucleotide sequence selected from the group consisting of that shown in Figure 1 (SEQ ID 
N0:1), Figure 3 (SEQ ID N0:5), Figure 5 (SEQ ID N0:7), Figure 8 (SEQ ID NO: 13), Figure 11 (SEQ ID 

20 NO: 19), Figure 14 (SEQ ID NO:22). Figure 17 (SEQ ID NO:27). Figure 19 (SEQ ID NO:29). Figure 22 (SEQ 
ID NO:32), Figure 24 (SEQ ID NO:35), Figure 26 (SEQ ID NO:40), Figure 29 (SEQ ID NO:46), Figure 31 
(SEQ ID N0:51). Figure 33 (SEQ ID NO:56), Figure 35 (SEQ ID NO:61), Figure 37 (SEQ ID NO:66), Figure 
40 (SEQ ID NO:72), Figure 46 (SEQ ID NO:83), Figure 48 (SEQ ID NO:94). Figure 50 (SEQ ID NO:96), 
Figure 52 (SEQ ID NO:98). Figure 56 (SEQ ID NO: 102). Figure 63 (SEQ ID NO: 112), Figure 65 (SEQ ID 

25 N0:114), Figure 67 (SEQ ID N0:116), Figure 69 (SEQ ID N0:1 18). Figure 71 (SEQ ID NO:123), Figure 73 
(SEQ ID NO: 128), Figure 75 (SEQ ID NO: 134), Figure 78 (SEQ ID NO: 137), Figure 82 (SEQ ID NO: 145), 
Figure 84 (SEQ ID NO: 147), Figure 87 (SEQ ID NO: 150), Figure 89 (SEQ ID NO: 152), Figure 92 (SEQ ID 
NO: 155), Figure 94 (SEQ ID NO: 157), Figure 96 (SEQ ID NO: 159). Figure 98 (SEQ ID NO: 164), Figure 100 
(SEQ ID NO: 166), Figure 102 (SEQ ID NO: 168), Figure 104 (SEQ ID NO: 170), Figure 108 (SEQ ID 

30 NO: 174), Figure 1 10 (SEQ ID NO: 176), Figure 1 12 (SEQ ID NO: 178), Figure 1 14 (SEQ ID NO: 180), Figure 
116 (SEQ ID NO:182), Figure 119 (SEQ ID NO:188). Figure 121 (SEQ ID NO:I93), Figure 124 (SEQ ID 
NO: 196). Figure 126 (SEQ ID NO: 198), Figure 128 (SEQ ID NO:200). Figure 130 (SEQ ID NO:202), Figure 
132 (SEQ ID NO:204), Figure 134 (SEQ ID NO:206), Figure 136 (SEQ ID NO:208). Figure 138 (SEQ ID 
NO:210). Figure 140 (SEQ ID NO:212). Figure 143 (SEQ ID NO:215), Figure 146 (SEQ ID NO:218), Figure 

35 148 (SEQ ID NO:220), Figure 150 (SEQ ID NO:222), Figure 152 (SEQ ID NO:224), Figure 154 (SEQ ID 
NO:226), Figure 156 (SEQ ID NO:228). Figure 158 (SEQ ID NO:230). Figure 160 (SEQ ID NO:235). Figure 
162 (SEQ ID NO:240), Figure 164 (SEQ ID NO:245), Figure 166 (SEQ ID NO:247), Figure 168 (SEQ ID 

507 



wo 99/63088 



PCT/US99/12252 



NO:249). Figure 170 (SEQ ID NO:252), Figure 173 (SEQ ID NO:255), Figure 175 (SEQ ID NO:257). Figure 
177 (SEQ ID NO:259), Figure 179 (SEQ ID NO:26l), Figure 181 (SEQ ID NO:263), Figure 183 (SEQ ID 
NO:265), Figure 185 (SEQ ID NO:267). Figure 187 (SEQ ID NO:269), Figure 189 (SEQ ID NO:271). Figure 
191 (SEQ ID NO:273), Figure 193 (SEQ ID NO:275), Figure 195 (SEQ ID NO:277), Figure 197 (SEQ ID 
NO:280), Figure 199 (SEQ ID NO:282), Figure 201 (SEQ ID NO:284), Figure 203 (SEQ ID NO:286), Figure 
5 205 (SEQ ID NO:288). Figure 207 (SEQ ID NO:290), Figure 209 (SEQ ID NO:292), Figure 211 (SEQ ID 
NO:294), Figure 213 (SEQ ID NO:296). Figure 215 (SEQ ID NO:298), Figure 217 (SEQ ID NO:300), Figure 
219 (SEQ ID NO:302). Figure 225 (SEQ ID NO:308), Figure 227 (SEQ ID NO:313). Figure 229 (SEQ ID 
NO:318), Figure 232 (SEQ ID NO:325), Figure 234 (SEQ ID NO:333), Figure 237 (SEQ ID NO:339), Figure 
239 (SEQ ID NO:344), Figure 241 (SEQ ID NO:346). Figure 243 (SEQ ID NO:348). Figure 245 (SEQ ID 

10 NO:350). Figure 247 (SEQ ID NO:352), Figure 249 (SEQ ID NO:354). Figure 251 (SEQ ID NO:356), Figure 
253 (SEQ ID NO:358), Figure 255 (SEQ ID NO:360), Figure 257 (SEQ ID NO:362), Figure 259 (SEQ ID 
NO:364), Figure 261 (SEQ ID NO:366), Figure 263 (SEQ ID NO:368), Figure 265 (SEQ ID NO:370), Figure 
267 (SEQ ID NO:372), Figure 269 (SEQ ID NO:374), Figure 271 (SEQ ID NO:376), Figure 273 (SEQ ID 
NO:378), Figure 275 (SEQ ID NO:380), Figure 277 (SEQ ID NO:386), Figure 279 (SEQ ID NO:388), Figure 

15 281 (SEQ ID NO:393), Figure 283 (SEQ ID NO:398), Figure 285 (SEQ ID NO:400), Figure 287 (SEQ ID 
NO:402). Figure 289 (SEQ ID NO:407), Figure 291 (SEQ ID NO:409), Figure 293 (SEQ ID N0:411). Figure 
295 (SEQ ID NO:413), Figure 297 (SEQ ID NO:415), Figure 299 (SEQ ID NO:417), Figure 301 (SEQ ID 
NO:419), Figure 303 (SEQ ID NO:421) and Figure 305 (SEQ ID NO:423). 

20 23. An isolated extracellular domain of of PRO polypeptide. 

24. An isolated PRO polypeptide lacking its associated signal peptide. 

25. An isolated polypeptide having at least about 80% amino acid sequence identity to an 
25 extracellular domain of of PRO polypeptide. 

26. An isolated polypeptide having at least about 80% amino acid sequence identity to a PRO 
polypeptide lacking its associated signal peptide. 
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FIGURE 1 

CGGACGCGTGGGTGCGAGGCGAAGGTGACCGGGGACCGAGCATTTCAGATCTGCTCGGTAGA 

CCTGGTGCACCACCAC CATG TTGGCTGCAAGGCTGGTGTGTCTCCGGACACTACCTTCTAGG 

GTTTTCCACCCAGCTTTCACCAAGGCCTCCCCTGTTGTGAAGAATTCCATCACGAAG^ 

ATGGCTGTTAACACCTAGCAGGGAATATGCCACCAAAACAAGAATTGGGATCCGGCGTGGGA 

GAACTGGCCAAGAACTCa^GAGGCAGCATTGGAACCATCGATGGAAAAA^ 

GATCAGATGGGAAGATGGTTTGTTGCTGGAGGGGCTGCTGTTGGTCTTGGAGCATTGTGCTA 

CTATGGCTTGGGACTGTCTAATGAGATTGGAGCTATTGAAAAGGCTGTAATTTGGCCTCAGT 

ATGTCAAGGATAGAATTCATTCCACCTATATGTACTTAGCAGGGAGTATTGGTTTAACAGCT 

TTGTCTGCCATAGCAATCAGCAGAACGCCTGTTCTCATGAACTTCATGATGAGAGGCTCTTG 

GGTGACAATTGGTGTGACCTTTGCAGCCATGGTTGGAGCTGGAATGCTGGTACGATCAATAC 

CATATGACCAGAGCCCAGGCCCAAAGCATCTTGCTTGGTTGCTACATTCTGGTGTGATGGGT 

GCAGTGGTGGCTCCTCTGACAATATTAGGGGGTCCTCTTCTCATCAGAGCTGCATGGTACAC 

AGCTGGCATTGTGGGAGGCCTCTCCACTGTGGCCATGTGTGCGCCCAGTGAAAAGTTTCTGA 

ACATGGGTGCACCCCTGGGAGTGGGCCTGGGTCTCGTCTTTGTGTCCTCATTGGGATCTATG 

TTTCTTCCACCTACCACCGTGGCTGGTGCCACTCTTTACTCAGTGGCAATGTACGGTGGATT 

AGTTCTTTTCAGCATGTTCCTTCTGTATGATACCCAGA?^GTAATCAAGCGTGCAGAAGTAT 

CACCAATGTATGGAGTTCAAAAATATGATCCCATTAACTCGATGCTGAGTATCTACATGGAT 

ACATTAAATATATTTATGCGAGTTGCAACTATGCTGGCAACTGGAGGCAACAGAAAGAAATG 

AAGTGACTCAGCTTCTGGCTTCTCTGCTACATCAAATATCTTGTTTAATGGGGCAGATATGC 

ATTAAATAGTTTGTACAAGCAGCTTTCGTTGAAGTTTAGAAGATAAGAAACATGTCATCATA 

TTTA/^TGTTCCGGTAATGTGATGCCTCAGGTCTGCCTTTTTTTCTGGAGAATAAATGCAGT 

AATCCTCTCCCAAATAAGCACACACATTTTCAATTCTCATGTTTGAGTGATTTTAAAATGTT 

TTGGTGAATGTGAAAACTAAAGTTTGTGTCATGAGAATGTAAGTCTTTTTTCTACTTTAAAA 

TTTAGTAGGTTCACTGAGTAACTAAAATTTAGCAAACCTGTGTTTGCATATTTTTTTGGAGT 

GCAGAATATTGTAATTAATGTCATAAGTGATTTGGAGCTTTGGTAAAGGGACCAGAGAGAAG 

GAGTCACCTGCAGTCTTTTGTTTTTTTAAATACTTAGAACTTAGCACTTGTGTTATTGATTA 

GTGAGGAGCCAGTAAGAAACATCTGGGTATTTGGAAACAAGTGGTCATTGTTACATTCATTT 

GCTGAACITAACAAAACTGTTCATCCrrGAAACAGGCACAGGTGATGa^TTCT^ 

CTTCTCAGTGCTCTCTTTCCAATATAGATGTGGTCATGTTTGACTTGTACAGAATGTTAATC 

ATACAGAGAATCCTTGATGGAATTATATATGTGTGTTTTACTTTTGAATGTTACAAAAGGAA 

ATAACTTTAAAACTATTCTCAAGAGAAAATATTCAAAGCATGAAATATGTTGCTTTTTCCAG 

AATACAAACAGTATACTCATG 
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FIGURE 2 

MLAARLVCLRTLPSRVFHPAFTKASPWKNSITKNQWLLTPSREYATKTRIGIRRGRTGQEL 
KEAALEPSMEKIFKIDQMGRWFVAGGAAVGLGALCYYGLGLSNEIGAIEKAVIWPQYVKDRI 
HSTYMYLAGSIGLTALSAIAISRTPVLMNFMMRGSWVTIGVTFAAMVGAGMLVRSIPYDQSP 
GPKHIAWU^SGVMGAWAPLTILGGPLLIRAAWYTAGIVGGLSTVAMCAPSEKFLNMGAPL 
GVGLGLVFVSSLGSMFLPPTTVAGATLYSVAMYGGLVLFSMFLLYDTQKVIKRAEVSPMYGV 
QKYDPINSMLSIYMDTLNIFMRVATMLATGGNRKK 
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FIGURE 3 

GAAGGCTGCCTCGCTGGTCCGAATTCGGTGGCGCCACGTCCGCCCGTCTCCGCCTTCTGCAT 
CGCGGCTTCGGCGGCTTCCACCTAGACACCTAACAGTCGCGGAGCCGGCCGCGTCGTGAGGG 
GGTCGGCACGGGGAGTCGGGCGGTCTTGTGCATCTTGGCTACCTGTGGGTCGAAGATGTCGG 
ACATCGGAGACTGGTTCAGGAGCATCCCGGCGATCACGCGCTATTGGTTCGCCGCCACCGTC 
GCCGTGCCCTTGGTCGGCAAACTCGGCCTCATGAGCCCGGCCTACCTCTTCCTCTGGCCCGA 
AGCCTTCCTTTATCGCTTTCAGATTTGGAGGCCAATCACTGCCACCTTTTATTTCCCTGTGG 
GTCCAGGAACTGGATTTCTTTATTTGGTCAATTTATATTTCTTATATCAGTATTCTACGCGA 
CTTGAAACAGGAGCTTTTGATGGGAGGCCAGCAGACTATTTATTCATGCTCCTCTTTAACTG 
GATTTGCATCGTGATTACTGGCTTAGCAATGGATATGCAGTTGCTGATGATTCCTCTGATCA 
TGTCAGTACTTTATGTCTGGGCCa^GCTGAACAGAGACATGATTGTATCATTTTGGTTTGGA 
ACACGATTTAAGGCCTGCTATTTACCCTGGGTTATCCTTGGATTCAACTATATCATCGGAGG 
CTCGGTAATCAATGAGCTTATTGGAAATCTGGTTGGACATCTTTATTTTTTCCTAATGTTCA 
GATACCCAATGGACTTGGGAGGAAGAAATTTTCTATCCACACCTCAGTTTTTGTACCGCTGG 
CTGCCCAGTAGGAGAGGAGGAGTATCAGGATTTGGTGTGCCCCCTGCTAGCATGAGGCGAGC 
TGCTGATCAGAATGGCGGAGGCGGGAGACACAACTGGGGCCAGGGCTTTCGACTTGGAGACC 
AGTGAAGGGGCGGCCTCGGGCAGCCGCTCCTCTCAAGCCACATTTCCTCCCAGTGCTGGGTG 
CACTTAACAACTGCGTTCTGGCTAACACTGTTGGACCTGACCCACACTGAATGTAGTCTTTC 
AGTACGAGACAAAGTTTCTTAAATCCCGAAGAAAAATATAAGTGTTCCACAAGTTTCACGAT 
TCTCATTCAAGTCCTTACTGCTGTGAAGAACAAATACCAACTGTGCAAATTGCAAAACTGAC 
TACATTTTTTGGTGTCTTCTCTTCTCCCCTTTCCGTCTGAATAATGGGTTTTAGCGGGTCCT 
AATCTGCTGGCATTGAGCTGGGGCTGGGTCACCAAACCCTTCCCAAAAGGACCTTATCTCTT 
TCTTGCACACATGCCTCTCTCCCACTTTTCCCAACCCCCACATTTGCAACTAGAAAAAGTTG 
CCCATAAAATTGCTCTGCCCTTGACAGGTTCTGTTATTTATTGACTTTTGCCAAGGCTGGTC 
ACAACAATCATATTCACGTTATTTTCCCCTTTTGGTGGCAGAACTGTTACCAATAGGGGGAG 
AAGACAGCCACGGATGAAGCGTTTCTCAGCTTTTGGAATTGCTTCGACTGACATCCGTTGTT 
AACCGTTTGCCACTCTTCAGATATTTTTTATAAAAAAAGTACCACTGAGTTCATGAGGGCCA 
CAGATTGGTTATTAATGAGATACGAGGGTTGGTGCTGGGTGTTTGTTTCCTGAGCTAAGTGA 
TCAAGACTGTAGTGGAGTTGCAGCTAACATGGGTTAGGTTTAAACCATGGGGGATGCACCCC 
TTTGCGTTTCATATGTAGCCCTACTGGCTTTGTGTAGCTGGAGTAGTTGGGTTGCTTTGTGT 
TAGGAGGATCCAGATCATGTTGGCTACAGGGAGATGCTCTCTTTGAGAGGTCCTGGGCATTG 
ATTCCCATTTCAATCTCATTCTGGATATGTGTTCATTGAGTAAAGGAGGAGAGACCCTCATA 
CGCTATTTAAATGTCACTTTTTTGCCTATCCCCCGTTTTTTGGTCATGTTTCAATTAATTGT 
GAGGAAGGCGCAGCTCCTCTCTGCACGTAGATCATTTTTTAAAGCTAATGTAAGCACATCTA 
AGGGAATAACATGATTTAAGGTTGAAATGGCTTTAGAATCATTTGGGTTTGAGGGTGTGTTA 
TTTTGAGTCATGAATGTACAAGCTCTGTGAATCAGACCAGCTTAAATACCCACACCTTTTTT 
TCGTAGGTGGGCTTTTCCTATCAGAGCTTGGCTCATAACCAAATAAAGTTTTTTGAAGGCCA 
TGGCTTTTCACACAGTTATTTTATTTTATGACGTTATCTGAAAGCAGACTGTTAGGAGCAGT 
ATTGAGTGGCTGTCACACTTTGAGGCAACTAAAAAGGCTTCAAACGTTTTGATCAGTTTCTT 
TTCAGGAAACATTGTGCTCTAACAGTATGACTATTCTTTCCCCCACTCTTAAACAGTGTGAT 
GTGTGTTATCCTAGGAAATGAGAGTTGGCAAACAACTTCTCATTTTGAATAGAGTTTGTGTG 
TACTTCTCCATATTTAATTTATATGATAAAATAGGTGGGGAGAGTCTGAACCTTAACTGTCA 
TGTTTTGTTGTTCATCTGTGGCCACAATAAAGTTTACTTGTAAAATTTTAGAGGCCATTACT 
CCAATTATGTTGCACGTACACTCATTGTACAGGCGTGGAGACTCATTGTATGTATAAGAATA 
TTTCTGACAGTGAGTGACCCGGAGTCTCTGGTGTACCCTCTTACCAGTCAGCTGCCTGCGAG 
CAGTCATTTTTTCCTAAAGGTTTACAAGTATTTAGAACTTTTCAGTTCAGGGCAAAATGTTC 
ATGAAGTTATTCCTCTTAAACATGGTTAGGAAGCTGATGACGTTATTGATTTTGTCTGGATT 
ATGTTTCTGGAATAATTTTACCAAAACAAGCTATTTGAGTTTTGACTTGACAAGGCAAAACA 
TGACAGTGGATTCTCTTTACAAATGGAAAAAAAAAATCCTTATTTTGTATAAAGGACTTCCC 
TTTTTGTAAACTAATCCTTTTTATTGGTAAAAATTGTAAATTAAAATGTGCAACTTG 
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FIGURE 4 

MSDIGDWFRS I PAITRYWFAATVAVPLVGKLGLI SPAYLFLWPEAFLYRFQI WRPITATF YF 

pvgpgtgflylvl^yflyqystrletgafdgrpadylfmllfnwicivitglamdmqlu^ip 

limsvlyvwaqijtrdmivsfwfgtrfkacylpwvilgfnyiiggsvinelignlvgh^ 

mfrypmdi/x;rnflstpqflyrwlpsrrggvsgfg\^pasmrraadqnggg^ 
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GGGGCCGCGGTCTAGGGCGGCTACGTGTGTTGCCATAGCGACCATTTTGCATTAACTGGTTG 
GTAGCTTCTATCCTGGGGGCTGAGCGACTGCGGGCCAGCTCTTCCCCTACTCCCTCTCGGCT 
CCTTGTGGCCCAAAGGCCTAACCGGGGTCCGGCGGTCTGGCCTAGGGATCTTCCCCGTTGCC 
CCTTTGGGGCGG GATG GCTGCGGAAGAAGAAGACGAGGTGGAGTGGGTAGTGGAGAGCATCG 
CGGGGTTCCTGCGAGGCCCAGACTGGTCCATCCCCATCTTGGACTTTGTGGAACAGAAATGT 
GAAGTTAACTGCAAAGGAGGGCATGTGATAACTCCAGGAAGCCCAGAGCCGGTGATTTTGGT 
GGCCTGTGTTCCCCTTGTTTTTGATGATGAAGAAGAAAGCAAATTGACCTATACAGAGATTC 
ATCAGGAATACAAAGAACTAGTTGAAAAGCTGTTAGAAGGTTACCTCAAAGAAATTGGAATT 
AATGAAGATCAATTTCAAGAAGCATGCACTTCTCCTCTTGCAAAGACCCATACATCACAGGC 
CATTTTGCAACCTGTGTTGGCAGCAGAAGATTTTACTATCTTTAAAGCAATGATGGTCCAGA 
AAAACATTGAAATGCAGCTGCAAGCCATTCGAATAATTCAAGAGAGAAATGGTGTATTACCT 
GACTGCTTAACCGATGGCTCTGATGTGGTCAGTGACCTTGAACACGAAGAGATGAAAATCCT 
GAGGGAAGTTCTTAGAAAATCAAAAGAGGAATATGACCAGGAAGAAGAAAGGAAGAGGAAAA 
AACAGTTATCAGAGGCTAAAACAGAAGAGCCCACAGTGCATTCCAGTGAAGCTGCAATAATG 
AATAATTCCCAAGGGGATGGTGAACATTTTGCACACCCACCCTCAGAAGTTAAAATGCATTT 
TGCTAATCAGTCAATAGAACCTTTGGGAAGAAAAGTGGAAAGGTCTGAAACTTCCTCCCTCC 
CACAAAAAGGCCTGAAGATTCCTGGCTTAGAGCATGCGAGCATTGAAGGACCAATAGCAAAC 
TTATCAGTACTTGGAACAGAAGAACTTCGGCAACGAGAACACTATCTCAAGCAGAAGAGAGA 
TAAGTTGATGTCCATGAGAAAGGATATGAGGACTAAACAGATACAAAATATGGAGCAGAAAG 
GAAAACCCACTGGGGAGGTAGAGGAAATGACAGAGAAACCAGAAATGACAGCAGAGGAGAAG 
CAAACATTACTAAAGAGGAGATTGCTTGCAGAGAAACTCAAAGAAGAAGTTATTAATAAGTA 
ATAATTAAGAACAATTTAACAAAATGGAAGTTCAAATTGTCTTAAAAATAAATTATTTAGTC 
CTTACACTG 
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FIGURE 6 

MAAEEEDEVEWVVESIAGFLRGPDWSIPILDFVEQKCEVNCKGGHVITPGSPEPVILVACVP 

LVFDDEEESKLTYTEIHQEYKELVEKLLEGYLKEIGINEDQFQEACTSPLAKTHTSQAILQP 

VLAAEDFTIFKAMMVQKNIEMQLQAIRIIQERNGVLPDCliTDGSDW 

RKSKEEYDQEEERKRKKQLSEAKTEEPTVHSSEAAIMNNSQGDGEHFAHPPSEVKMHF^ 

lEPLGRKVERSETSSLPQKGLKIPGLEHASIEGPIANLSVLGTEELRQREHYLKQKRDKU^ 

MRKDMRTKQIQNI^QKGKPTGEVEEMTEKPEMTAEEKQTIJjKRRLL^ 
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FIGURE? 

GGGCACAGCACATGTGAAGTTTTTGATGATGAAGAAGAAAGCAAATTGACCTATACAGAGAT 

TCATCAGGAATACAAAGAACTAGTTGAAAAGCTGTTAGAAGGTTACCTCAAAGAAATTG^ 

TTAATGAAGATCAATTTCAAGAAGCATGCACTTCTCCTCTTGCAAAGACCCATACATC^ 

GCCATTTTTGCAACCTGTGTTGGCAGCAGAAGATTTTACTATCTTTAAAGC^ 

AGAAAAACATTGAAATGCAGCTGCAAGCCATTCGAATAATTCAAGAGAGAAATGGTGTATTA 

CCTGACTGCTTAACCGATGGCTCTGATGTGGTCAGTGACCTTGAACACGAAGAGATGAAAAT 

CCTGAGGGAAGTTCTTAGAAAATCAAAAGAGGAATATGACCAGGAA 
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FIGURES 

GCGTGGTTTTTGTTCTGCAATAGGCGGCTTAGAGGGAGGGGCTTTTTCGCCTATACCTACT 

TAGCTTCTCCACGTATGGACCCTAAAGGCTACTGCTGCTACTACGGGGCTAGACAGTTACTG 

TCTCAGCTCTAGGATGTGCGTTCTTCCACTAGAAGCTCTTCTGAGGGAGGTAATTAAAAAAC 

AGTGGAATGGAAAAACAGTGCTGTAGTCATCCTGTAATATGCTCCTTGTCAACAATGTATAC 

ATTCCTGCTAGGTGCCATATTCATTGCTTTAAGCTCAAGTCGCATCTTACTAGTGAAGTATT 

CTGCCAATGAAGAAAACAAGTATGATTATCTTCCAACTACTGTGAATGTGTGCTCAGAACTG 

GTGAAGCTAGTTTTCTGTGTGCTTGTGTCATTCTGTGTTATAAAGAAAGATCATCAAAGTAG 

AAATTTGAAATATGCTTCCTGGAAGGAATTCrCTGATTTCATGAAGTGGTCaVTTCCTGCCT 

TTCTTTATTTCCTGGATAACTTGATTGTCTTCTATGTCCTGTCCTATCTTCAACCAGCCATG 

GCTGTTATCTTCTCAAATTTTAGCATTATAACAACAGCTCTTCTATTCAGGATAGTGCTGAA 

GAGGCGTCTAAACTGGATCCAGTGGGCTTCCCTCCTGACTTTATTTTTGTCTATTGTGGCCT 

TGACTGCCGGGACTAAAACTTTACAGCACAACTTGGCAGGACGTGGATTTCATCACGATGCC 

TTTTTCAGCCCTTCCAATTCCTGCCTTCTTTTCAGAAGTGAGTGTCCCAGAAAAGACAATTG 

TACAGCAAAGGAATGGACTTTTCCTGAAGCTAAATGGAACACCACAGCCAGAGTTTTCAGTC 

ACATCCGTCTTGGCATGGGCCATGTTCTTATTATAGTCCAGTGTTTTATTTCTTCAATGGCT 

AATATCTATAATGAAAAGATACTGAAGGAGGGGAACCAGCTCACTGAAAGCATCTTCATACA 

GAACAGCAAACTCTATTTCTTTGGCATTCTGTTTAATGGGCTGACTCTGGGCCTTCAGAGGA 

GTAACCGTGATCAGATTAAGAACTGTGGATTTTTTTATGGCCACAGTGCATTTTCAGTAGCC 

CTTATTTTTGTAACTGCATTCCAGGGCCTTTCAGTGGCTTTCATTCTGAAGTTCCTGGATAA 

CATGTTCCATGTCTTGATGGCCCAGGTTACCACTGTCATTATCACAACAGTGTCTGTCCTGG 

TCTTTGACTTCAGGCCCTCCCTGGAATTTTTCTTGGAAGCCCCATCAGTCCTTCTCTCTATA 

TTTATTTATAATGCCAGCAAGCCTCAAGTTCCGGAATACGCACCTAGGCAAGAAAGGATCCG 

AGATCTAAGTGGCAATCTTTGGGAGCGTTCCAGTGGGGATGGAGAAGAACTAGAAAGACTTA 

CCAAACCCAAGAGTGATGAGTCAGATGAAGATACTTTCTAACTGGTACCCACATAGTTTGCA 

GCTCTCTTGAACCTTATTTTCACATTTTCAGTGTTTGTAATATTTATCTTTTCACTTTGATA 

AACCAGAAATGTTTCTAAATCCTAATATTCTTTGCATATATCTAGCTACTCCCTAAATGGTT 

CCATCCAAGGCTTAGAGTACCCAAAGGCTAAGAAATTCTAAAGAACTGATACAGGAGTAACA 

ATATGAAGAATTCATTAATATCTCAGTACTTGATAAATCAGAAAGTTATATGTGCAGATTAT 

TTTCCTTGGCCTTCAAGCTTCCAAAAAACTTGTAATAATCATGTTAGCTATAGCTTGTATAT 

ACACATAGAGATCAATTTGCCAAATATTCACAATCATGTAGTTCTAGTTTACATGCCAAAGT 

CTTCCCTTTTTAACATTATAAAAGCTAGGTTGTCTCTTGAATTTTGAGGCCCTAGAGATAGT 

CATTTTGCAAGTAAAGAGCAACGGGACCCTTTCTAAAAACGTTGGTTGAAGGACCTAAATAC 

CTGGCCATACCATAGATTTGGGATGATGTAGTCTGTGCTAAATATTTTGCTGAAGAAGCAGT 

TTCTCAGACACAACATCTCAGAATTTTAATTTTTAGAAATTCATGGGAAATTGGATTTTTGT 

AATAATCTTTTGATGTTTTAAACATTGGTTCCCTAGTCACCATAGTTACCACTTGTATTTTA 

AGTCATTTAAACAAGCCACGGTGGGGCTTTTTTCTCCTCAGTTTGAGGAGAAAAATCTTGAT 

GTCATTACTCCTGAATTATTACATTTTGGAGAATAAGAGGGCATTTTATTTTATTAGTTACT 

AATTCAAGCTGTGACTATTGTATATCTTTCCAAGAGTTGAAATGCTGGCTTCAGAATCATAC 

CAGATTGTCAGTGAAGCTGATGCCTAGGAACTTTTAAAGGGATCCTTTCAAAAGGATCACTT 

AGCAAACACATGTTGACTTTTAACTGATGTATGAATATTAATACTCTAAAAATAGAAAGACC 

AGTAATATATAAGTCACTTTACAGTGCTACTTCACACTTAAAAGTGCATGGTATTTTTCATG 

GTATTTTGCATGCAGCCAGTTAACTCTCGTAGATAGAGAAGTCAGGTGATAGATGATATTAA 

AAATTAGCAAACAAAAGTGACTTGCTCAGGGTCATGCAGCTGGGTGATGATAGAAGAGTGGG 

CTTTAACTGGCAGGCCTGTATGTTTAC7VGACTACCATACTGTAAATATGAGCTTTATGGTGT 

CATTCTCAGAAACTTATACATTTCTGCTCTCCTTTCTCCTAAGTTTCATGCAGATGAATATA 

AGGTAATATACTATTATATAATTCATTTGTGATATCCACAATAATATGACTGGCAAGAATTG 

GTGGAAATTTGTAATTAAAATAATTATTAAACCT 



wo 99/63088 



FIGURE 9 



PCT/US99/I2252 



MEKQCCSHPVICSLSTMYTFLLGAIFIALSSSRILLVKYSANEENKYDYLPTTVNVCSELVK 

LVFCVLVSFCVIKKDHQSRNIJCYASWKEFSDFMKWSIPAFLYFLDNLIVFYVLSYLQPAMAV 

IFSNFSIITTALLFRIVLKRRLNWIQWASLLTLFLSIVALTAGTKTLQHNLAGRGFHHDAFF 

SPSNSCLLPRSECPRKDNCTAKEWTFPEAKWNTTARVFSHIRLGMGHVLIIVQCFISSMANI 

YNEKILKEGNQLTESIFIQNSKLYFFGILFNGLTLGLQRSNRDQIKNCGFFYGHSAFSVALI 

FVTAFQGLSVAFILKFIJDNMFHVLMAQVTTVIITTVSVLVFDFRPSLEFFLEAPSVLLSIFI 

YNASKPQVPEYAPRQERIRDLSGNLWERSSGDGEELERLTKPKSDESDEDTF 
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FIGURE 10 

CGTGCCTGCGCAATGGGTGTCGGGTCCGCTTTTTCCCAATCCGGACGTAATCGTGGTTTTTG 

TTCTGCAATAGGCGGCTTAGAGGGAGGGGCTTTTTCGCCTATACCTACTGTAGCTTCTCC^ 

GTATGGACCCTAAAGGCTACTGCTGCTACTACGGGGCTAGACAGTTACTGTCTCAGCTCTAG 

GATGTGCGTTCTTCO^CTAGAAGCTCTTCTGAGGGAGGTAATTAAAAAACAGTGGAATGGAA 

AAACAGTGCTGTAGTCATCCTGTAATATGCTCCTTGTCAACAATGTATACATTCCTGCTAGG 

TGCCATATTCATTGCTTTAAGCTCAAGTCGCATCTTACTAGTGAAGTATTCTGCCAATGAAG 

AAAACAAGTATGATTATCTTCCAACTACTGTGAATGTGTGCTCAGAACTGGTGAAGCTAGTT 

TTCTGTGTGCTTGTGTCATTCTGTGTTATAAAGAAAGATCATCAAAGTAGAAATTTGAAATA 

TGCTTCCTGGAAGGAATTCTCTGATTTCATGAAGTGGTCCATTCCTGCCTTTCTTTATTTCC 

TGGATAACTTGATTGTCTTCTATGTCCTGTCCTATCTTCAACCAGCCATGGCTGTTATCTTC 

TCAAATTTTAGCATTATAACAACAGCTCTTCTATTCAGGATAGTGCTGAAGAGGCGTCTAAA 

CTGGATCCyVGTGGGCTTCCCTCCTGACTTTATTTTTGTCTATTGTGGCCTTGACTGCCGGGA 

CTAAAACTTTA 
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FIGURE 11 

CGGACGCGTGGGCGGACGCGTGGGCGGACGCGTGGGGCCGGCTTGGCTAGCGCGCGGCGGCC 
GTGGCTAAGGCTGCTACGAAGCGAGCTTGGGAGGAGCAGCGGCCTGCGGGGCAGAGGAGCAT 
CCCGTCTACCAGGTCCCAAGCGGCGTGGCCCGCGGGTCATGGCCAAAGGAGAAGGCGCCGAG 
AGCGGCTCCGCGGCGGGGCTGCTACCCACCAGCATCCTCCAAAGCACTGAACGCCCGGCCCA 
GGTGAAGAAAGAACCGAAAAAGAAGAAACAACAGTTGTCTGTTTGCAACAAGCTTTGCTATG 
CACTTGGGGGAGCCCCCTACCAGGTGACGGGCTGTGCCCTGGGTTTCTTCCTTCAGATCTAC 
CTATTGGATCTGGCTCAGGTGGGCCCTTTCTCTGCCTCCATCATCCTGTTTGTGGGGCGAGC 
CTGGGATGCCATCACAGACCCCCTGGTGGGCCTCTGCATCAGa^TCCCCCrGGACCTGCC 
TGGGTCGCCTTATGCCCTGGATCATCTTCTCCACGCCCCTGGCCGTCATTGCCTACTTCCTC 
ATCTGGTTCGTGCCeGACTTCCCACACGGCCAGACCTATTGGTACCTGCTTTTCTATTGCCT 
CTTTGAAACAATGGTCACGTGTTTCCATGTTCCCTACTCGGCTCTCACCATGTTCATCAGCA 
ACCGAGCAGACTGAGCGGGATTCTGCCACCGCCTATCGGATGACTGTGGAAGTGCTGGGCAC 
AGTGCTGGGCACGGCGATCCAGGGACAAATCGTGGGCCAAGCAGACACGCCTTGTTTCCAGG 
ACTTCAATAGCTCTACAGTAGCTTCACAAAGTGCCAACCATACACATGGCACCACTTCACAC 
AGGGAAACGCAAAAGGCATACCTGCTGGCAGCGGGGGTCATTGTCTGTATCTATATAATCTG 
TGCTGTCATCCTGATCCTGGGCGTGCGGGAGCAGAGAGAACCCTATGAAGCCCAGCAGTCTG 
AGCCAATCGCCTACTTCCGGGGCCTACGGCTGGTCATGAGCCACGGCCCATACATCAAACTT 
ATTACTGGCTTCCTCTTCACCTCCTTGGCTTTCATGCTGGTGGAGGGGAACTTTGTCTTGTT 
TTGCACCTACACCTTGGGCTTCCGCAATGAATTCCAGAATCTACTCCTGGCCATCATGCTCT 
CGGCCACTTTAACCATTCCCATCTGGCAGTGGTTCTTGACCCGGTTTGGCAAGAAGACAGCT 
GTATATGTTGGGATCTCATCAGCAGTGCCATTTCTCATCTTGGTGGCCCTCATGGAGAGTAA 
CCTCATCATTACATATGCGGTAGCTGTGGCAGCTGGCATCAGTGTGGCAGCTGCCTTCTTAC 
TACCCTGGTCCATGCTGCCTGATGTCATTGACGACTTCCATCTGAAGCAGCCCCACTTCCAT 
GGAACCGAGCCCATCTTCTTCTCCTTCTATGTCTTCTTCACCAAGTTTGCCTCTGGAGTGTC 
ACTGGGCATTTCTACCCTCAGTCTGGACTTTGCAGGGTACCAGACCCGTGGCTGCTCGCAGC 
CGGAACGTGTCAAGTTTACACTGAACATGCTCGTGACCATGGCTCCCATAGTTCTCATCCTG 
CTGGGCCTGCTGCTCTTCAAAATGTACCCCATTGATGAGGAGAGGCGGCGGCAGAATAAGAA 
GGCCCtGCAGGCACTGAGGGACGAGGCCAGCAGCTCTGGCTGCTCAGAAACAGACTCCACAG 
AGCTGGCTAGCATCCTCTAGGGCCCGCCACGTTGCCCGAAGCCACCATGCAGAAGGCCACAG 
AAGGGATCAGGACCTGTCTGCGGGCTTGCTGAGCAGCTGGACTGCAGGTGCTAGGAAGGGAA 
CTGAAGACTCAAGGAGGTGGCCCAGGACACTTGCTGTGCTCACTGTGGGGCCGGCTGCTCTG 
TGGCCTCCTGCCTCCCCTCTGCCTGCCTGTGGGGCCAAGCCCTGGGGCTGCCACTGTGAATA 
TGCCAAGGACTGATCGGGCCTAGCCCGGAACACTAATGTAGAAACCTTTTTTTTACAGAGCC 
TAATTAATAACTTAATGACTGTGTACATAGCAATGTGTGTGTATGTATATGTCTGTGAGCTA 
TTAATGTTATTAATTTTCATAAAAGCTGGAAAGC 
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FTGIJRE 12 

MWLRWALSLPPSSCLWAEPGMPSQTPWWASASANPPGPAWVALCPGSSSPRPWPSLPTSSSG 
SCPTSHTARPIGTCFSIASLKQWSRVSMFPTRLSPCSSATEQTERDSATAYRMTVEVLGTVL 
GTAIQGQIVGQADTPCFQDFNSSTVASQSANHTHGTTSHRETQKAYLLAAGVIVCIYIICAV 
ILILGVREQREPYEAQQSEPIAYFRGLRLVMSHGPYIKLITGFLFTSLAFMLVEGNFVLFCT 
YTLGFRNEFQNLLLAIMLSATLTIPIWQWFLTRFGKKTAVYVGISSAVPFLILVALMESNLI 
ITYAVAVAAGISVAAAFLLPWSMLPDVIDDFHLKQPHFHGTEPIFFSFYVFFTKFASGVSLG 
ISTLSLDFAGYQTRGCSQPERVKFTLNMLVTMAPIVLILLGLLLFKMYPIDEERRRQNKKAL 
QALRDEASSSGCSETDSTELASIL 
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FIGURE 13 

GGGAAACGCAAAAGGCATACCTGCTGGCAGCGGGGGTCATTGTCTGTATCTATATAATCTGT 

GCTGTCATCCTGATCCTGGGCGTGCGGGAGCAGAGAGAACCCTATGAAGCCCAGCA^^ 

GCCAATCGCCTACTTCCGGGGCCTACGGCTGGTCATGAGCCACGGCCCATACATCAAAC 

TTACTGGCTTCCTCTTCACCTCCTTGGCTTTCATGCTGGTGGAGGGGAACTTO 

TGCACCTACACCTTGGGCTTCCGCAATGAATTCCAGAATCTACTCCTGGCCATCATGCTCTC 

GGCCACTTTAACCATTCCCATCTGGCAGTGGTTCTTGACCCGGTTTGGCAAGAAGACAGCTG 

TATATGTTGGGATCTCATCAGCAGTGCCATTTCTCATCTTGGTGGCCCTCATGGAGAGTAAC 

CTCATCATTACATATGCGGTAGCTGTGGCAGCTGGCATCAGTGTGGCAGCTGCCTTCTTACT 

ACCCTGGTCCATGCTGCCTGATGTCATTGACGACTTCCATCTGAAGCAGCCCCACTTCCATG 

GAACCGAGCCCAT 
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FIGURE 14 

GGGGCTTCGGCGCCy^GCGGCCAGCGCTAGTCGGTCTGGTAAGGATTTACAAAAGGTGCAGGT 
ATGAGCAGGTCTGAAGACTAACATTTTGTGAAGTTGTAAAACAGAAAACCTGTTAGAA&TC 
GGTGGTTTCAGCAAGGCCTCAGTTTCCTTCCTTCAGCCCrTGTAATTTGGAC^ 
TTO^TATTTTCy^TACATTACTGCAGTAACACTCCACC^ 

CAGTGACACTGGTACAGTAGCTCCAGAAAAATGCTTATTTGGGGCaATGCTAAATATO 

CAGTTTTATGCATTGCTACCATTTATGTTCGTTATAAGCAAGTTCATGCTCTGAGTCCT 

GAGAACGTTATCATCAAATTAAACAAGGCTGGCCTTGTACTTGGAATACTGAGTTGTTTAGG 

ACTTTCTATTGTGGCAAACTTCCAGAAAACAACCCTTTTTGCTGCACATGTAAGTGGAGCTG 

TGCTTACCTTTGGTATGGGCTCATTATATATGTTTGTTCAGACCATCCITTCCTACa^^ 

CAGCCCAAAATCCATGGCAAACAAGTCTTCTGGATCAGACTGTTGTTGGTTATCTGGTGTGG 

AGTAAGTGCACTTAGCATGCTGACriTGCTCATCAGTTTTGCACAGTGGCAA 

ATTTAGAACAGAAACTCCATTGGAACCCCGAGGACAAAGGTTATGTGCTTCACATGATCACT 

ACTGCAGCAGAATGGTCTATGTCATTTTCCTTCTTTGGTTTTTTCCTGACTTACATTCGTGA 

TTTTCAGAAAATTTCTTTACGGGTGGAAGCCAATTTACATGGATTAACCCTCTATGACACTG 

CACCTTGCCCTATTAACAATGAACGAACACGGCTACTTTCCAGAGATATTTGATGAAAGGAT 

AAAATATTTCTGTAATGATTATGATTCTCAGGGATTGGGGAAAGGTTCACAGAAGTTGCTTA 

TTCTTCTCTGAAATTTTCAACCACTTAATCAAGGCTGACAGTAACACTGATGAATGCTGATA 

ATCAGGAAACATGAAAGAAGCCATTTGATAGATTATTCTAAAGGATATCATCAAGAAGACTA 

TTAAAAACACCTATGCCTATACTTTTTTATCTCAGAAAATAAAGTCTy^ 
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FIGURE 15 

MWWFQQGLSFLPSALVIWTSAAFIFSYITAVTLHHIDPALPYISDTGTVAPEKCLFGAMLNI 
AAVLCIATIYVRYKQVHALSPEENVIIKLNKAGLVLGILSCLGLSIVANFQKTTLFAAHVSG 
AVLTFGMGSLYMFVQTILSYQMQPKIHGKQVFWIRLLLVIWCGVSALSMLTCSSVLHSGNFG 
TDLEQKLHWNPEDKGYVLHMITTAAEWSMSFSFPGFFLTYIRDFQKISLRVEANLHGLTLYD 
TAPCPINNERTRLLSRDI 
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FIGURE 16 

CGGACGCTTGGGCNGCGCCAGCGGCCAGCGCTAGTCGGTCTGGTAAGTGCCTGATGCCGAGT 

TCCGTCrcrCGGGTCTTTTCCTGGTCCCAGGCAAAGCGGAGCGGAGATCCTa^ 

GTGCTTCGCGCTTCCGGAGAAAATCAGCGGTCTAATTAATTCCTCTGGTTTGTTGTAGC^ 

TACCAAGAATCTTCAACCCTTTCCCACAAAAGCTAATTGAGTACACGTTCCTGT^ 

CGTTCCTGTTGATTTACAAAAGGTGCAGGTATGAGCAGGTCrGAAGACTAAO^T^^ 

GTTGTAAAACAGAAAACCTGTTAGAAATGTGGTGGTTTCAGCAAGGCCTCAGTTTCCTTCCT 

TCAGCCCTTGTAATTTGGACATCTGCTGCTTTCATATTTTCATACATTACTGCAGTAACAC^ 

CCACCATATAGACCCGGCTTTACCTTATATCAGTGACACTGGTACAGTANC 
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FIGURE 17 

CCCACGCGTCCGCCCGCCGCTGCGTCCCGGAGTGCAAGTGAGCTTCTCGGCTGCCCCGCGGG 
CCGGGGTGCGGAGCCGACMGCGCCCGCTTCrCGGCCTCCTTCTGGTCTTCGCCGGCTGCAC 
CTTCGCCITGTACTTGCTGTCGACGCGACTGCCCCGCGGGCGGAGACTGGGCTCCACCGAGG 
AGGCTGGAGGCAGGTCGCTGTGGTTCCCCTCCGACCTGGCAGAGCTGCGGGAGCTCTCTGAG 
GTCCTTCGAGAGTACCGGAAGGAGCACCAGGCCTACGTGTTCCTGCTCTTCTGCGGCGCCTA 
CCTOTACAAACAGGGCTTTGCCATCCCCGGCTCCAGCTTCCTGAATGTTTTAGCTGGTC 

tgtttgggccatggctggggcttctgctgtgctgtgtgttgacctcggtgggtgccacatgc 
tgctacctgctctccagtatttttggcaaacagttggtggtgtcctactttcctgataaagt 
ggccctgctgcagagaaaggtggaggagaacagaaacagcttgttttttttcttattgtttt 
tgagacttttccccatgacaccaaactggttcttgaacctctcggccccaattctgaacatt 
cccatcgtgcagttcttcttctcagttcttatcggtttgatcccatataatttcatctgtgt 
gcagacagggtccatcctgtcaaccctaacctctctggatgctcttttctcctgggacactg 
tctttaagctgttggccattgccatggtggcattaattcctggaaccctcattaaaaaattt 
agtcagaaacatctgo^ttgaatgaaacaagtactgctaatcatatacacagtag;^^ 

CACATGATCTGGATTTTCTGTTTGCCACATCCCTGGACTCAGTTGCTTATTTGTGTAATGGA 

tgtggtcctctaaagcccctcattgtttttgattgccttctataggtgatgtggacactgtg 
catcaatgtgcagtgtcttttcagaaaggacactctgctcttgaaggtgtattacatcaggt 
tttcaaaccagccctggtgtagcagacactgcaacagatgcctcctagaaaatgctgtttgt 
ggccgggcgcggtggctcacgcctgtaatcccagcactttgggaggccgaggccggtgattc 
acaaggtcaggagttcaagaccagcctggccaagatggtgaaatcctgtctctaataaaaat 
acaaaaattagccaggcgtggtggcaggcacctgtaatcccagctactcgggaggctgaggc 
aggagaattgcttgaaccaaggtggcagaggttgcagtaagccaagatcacaccactgcact 

CCAGCCTGGGTGATAGAGTGAGACACTGTCTTGAC 
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FIGURE 18 

MRPLLGLIJ^VFAGCTFALYLLSTRLPRGRRLGSTEEAGGRSLWFPSDIJ^LRELSEVLR^ 

KEHQAYVFLLFCGAYLYKQGFAIPGSSFLNVLAGALFGPWIXSI^LCC^ 

I FGKQLWSYFPDKVALLQRKVEEOTWSLFFFLLFLRLFPMTPNWFLNLSAP ILNI PI V 

FSVLIGLIPYNFICVQTGSILSTLTSIJ)ALFSWDTVFKLLAIAMVALIPGTLIKKFSQK^ 
LNETSTANHIHSRKDT 
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FIGURE 19 

CCGAGGCGGGAGGAGCCCGAGGGGGCGCGAGCCCCGCT^TGAATCATTGTAGTCAATCATTTT 

CCAGTTCTCAGCCGCTCAGTTGTGATCAAGGGACACGTGGTTTCCGAACTGCCA^ 

TAGGA^^TAACTTGGGATTTTATATTGGAAGACATGGATCTO 

TTTATGACAAACTTTCAGAGACTGTTGATTTGGTGAGACAGACCGGCCATC^ 

TCAGAGAAGGCAATTGAAAAATTTATCAGAOVGCTGCTGGAAAAGAATGAACCTCAG^ 

CCCCCCGCAGTATCCTCTCCTTATAGTTGTGTATAAGGTTCrCGCAACCTTGGGATTAATCT 

TGCrCACTGCCTACTTTGTGATTC?UVCCTTTCAGCCCATTAGCACCTGAGCCAGTGCTT^ 

GGAGCTCACACCTGGCGCTCACTCATCCATCACATTAGGCTGATGTCCTTGCCCATTGCC^ 

GAAGTACATGTCAGAAAATAAGGGAGTTCCTCTGCATGGGGGTGATGAAGACAGACCCTTTC 

CAGACTTTGACCCCTGGTGGACAAACGACTGTGAGCAGAATGAGTCAGAGCCCATTCCTGCC 

AACTGCACTGGCTGTGCCCAGAAACACCTGAAGGTGATGCTCCTGGAAGACGCCCCAAGGAA 

ATTTGAGAGGCTCCATCCACTGGTGATCAAGACGGGAAAGCCCCTGTTGGAGGAAGAGATTC 

AGCATTTTTTGTGCCAGTACCCTGAGGCGACAGAAGGCTTCTCTGAAGGGTTTTTCGCC^ 

TGGTGGCGCTGCTTTCCTGAGCGGTGGTTCCCATTTCCTTATCCATGGAGGAGACCTCTGAA 

CAGATCACAAATGTTACGTGAGCTTTTTCCTGTTTTCACTCACCTGCCATTTCCAAAAGATG 

CCrCTTTAAACAAGTGCTCCTTTCTTCACCCAGAACCTGTTGTGGGGAGTAAGATGCATAAG 

ATGCCTGACCTATTTATCATTGGCAGCGGTGAGGCCATGTTGCAGCTCATCCCTCCCTTCCA 

GTGCCGAAGACATTGTCAGTCTGTGGCCATGCCAATAGAGCCAGGGGATATCGGCTATGTCG 

ACACCACCCACTGGAAGGTCTACGTTATAGCCAGAGGGGTCCAGCCTTTGGTCATCTGCGAT 

GGAACCGCTTTCTCAGAACTGTAGGAAATAGAACrGTGCACAGGAACAGCTTCCAGAGCCGA 

AAACCAGGTTGAAAGGGGAAAAATAAAAACAAAAACGATGAAACTGCAAAAA 
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FIGURE 20 

MDLAANEISIYDKLSETVDLVRQTGHQCGMSEKAIEKFIRQLLEKNEPQRPPPQYPLLIVVY 
KVLATI<GLILLTAYFVIQPFSPIJa>EPVLSGAHTWRSLIHHIRLMSLPIAKKYMSENKGVPL 
HGGDEDRPFPDFDPWWTNDCEQNESEPIPANCTGCAQKHLKVNILLEDAPRKFERLHPLVIKT 
GKPLLEEEIQHFLCQYPEATEGFSEGFFAKWWRCFPERWFPFPyPWRRPLNRSQMLRELFPV 
FTHLPFPKDASLNKCSFLHPEPWGSKMHKMPDLFIIGSGEAMLQLIPPFQCRRHCQSVAMP 
lEPGDIGYVDTTHWKVYVIARGVQPLVICDGTAFSEL 
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FIGURE 21 

CCACGGTGTCCGTTCTTCGCCCGGCGGCAGCTGTCCCCGAGGCGGGAGGAGCCCGAGGGGCG 
CGAGCCCCGCATGAATCATTGTAGTCAATCATTTTCCAGTTCTCAGCCGTTCAGTTGTGATC 
AAGGGACACGTGGTTTCCGAACTGCCAGCTCAGAATAGGAAAATAACTTGGGATTTTATATT 
GGAAGACATGGATCTTGCTGCCAACGAGATCAGCATTTATGACAAACTTTCAGAGACTGTT^ 
ATTTGGTGAGACAGACCGGCCATCAGTGTGGCATGTCAGAGAAGGCAATTGAAAAATTTATC 
AGACAGCTGCTGGAAAAGAATGAACCTCAGAGACCCCCCCCGCAGTATCCTCTCCTTATAGT 
TGTGTATAAGGTTCTCGCAACCTTGGGATTAATCTTGCTCACTGCCTACTTTGTGATTCAAC 
CTTTCAGCCCATTAGCACCTGAGCCAGTGCTTTGTGGAGCTCAC 
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FIGURE 22 

CCCACGCGTCCGCCCACGCGTCCGGCTGAACACCTCTTCTTTGGAGTCAGCCACTGATGAGG 
CAGGGTCCCCACTTGCAGCTGCAGCAGCTGCAGCAGCTGCAGAGCGCTGCTCCTGGCTGGTG 
CCACTGGTGCGCACGCTGCTAGACCGTGCCTATGAGCCGCTGGGGCTGCAGTGGGGACTGCC 
CTCCCTGCCACCCACCAATGGCAGCCCCACCTTCTTTGAAGACTTCCAGGCTTTTTGTGCCA 
CACCCGAATGGCGCCACTTCATCGACAAACAGGTACAGCCAACCATGTCCCAGTTCGAAATG 
GACACGTATGCTAAGAGCCACGACCTTATGTCAGGTTTCTGGAATGCCTGCTATGACATGCT 
TATGAGCAGTGGGCAGCGGCGCCAGTGGGAGCGCGCCCAGAGTCGTCGGGCCTTCCAGGAGC 
TGGTGCTGGAACCTGCGCAGAGGCGGGCGCGCCTGGAGGGGCTACGCTACACGGCAGTGCTG 
AAGCAGCAGGCAACGCAGCACTCCATGGCCCTGCTGCACTGGGGGGCGCTGTGGCGCCAGCT 
CGCCAGCCCATGTGGGGCCTGGGCGCTGAGGGACACTCCCATCCCCCGCTGGAAACTGTCCA 
GCGCCGAGACATATTCACGCATGCGTCTGAAGCTGGTGCCCAACCATCACTTCGACCCTCAC 
CTGGAAGCCAGCGCTCTCCGAGACAATCTGGGTGAGGTTCCCCTGACACCCACCGAGGAGGC 
CTCACTGCCTCTGGCAGTGACCAAAGAGGCCAAAGTGAGCACCCCACCCGAGTTGCTGCAGG 
AGGACCAGCTCGGCGAGGACGAGCTGGCTGAGCTGGAGACCCCGATGGAGGCAGCAGAACTG 
GATGAGCAGCGTGAGAAGCTGGTGCTGTCGGCCGAGTGCCAGCTGGTGACGGTAGTGGCCGT 
GGTCCCAGGGCTGCTGGAGGTCACCACACAGAATGTATACTTCTACGATGGCAGCACTGAGC 
GCGTGGAAACCGAGGAGGGCATCGGCTATGATTTCCGGCGCCCACTGGCCCAGCTGCGTGAG 
GTCCACCTGCGGCGTTTCAACCTGCGCCGTTCAGCACTTGAGCTCTTCTTTATCGATCAGGC 
CAACTACTTCCTCAACTTCCCATGCAAGGTGGGCACGACCCCAGTCTCATCTCCTAGCCAGA 
CTCCGAGACCCCAGCCTGGCCCCATCCCACCCCATACCCAGGTACGGAACCAGGTGTACTCG 
TGGCTCCTGCGCCTACGGCCCCCCTCTCAAGGCTACCTAAGCAGCCGCTCCCCCCAGGAGAT 
GCTGCGTGCCTCAGGCCTTACCCAGAAATGGGTACAGCGTGAGATATCCAACTTCGAGTACT 
TGATGCAACTCAACACCATTGCGGGGCGGACCTACAATGACCTGTCTCAGTACCCTGTGTTC 
CCCTGGGTCCTGCAGGACTACGTGTCCCCAACCCTGGACCTCAGCAACCCAGCCGTCTTCCG 
GGACCTGTCTAAGCCCATCGGTGTGGTGAACCCCAAGCATGCCCAGCTCGTGAGGGAGAAGT 
ATGAAAGCTTTGAGGACCCAGCAGGGACCATTGACAAGTTGCACTATGGCACCCACTACTCC 
AATGCAGCAGGCGTGATGCACTACCTCATCCGCGTGGAGCCCTTCACCTCCCTGCACGTCCA 
GCTGCAAAGTGGCCGCTTTGACTGCTCCGACCGGCAGTTCCACTCGGTGGCGGCAGCCTGGC 
AGGCACGCCTGGAGAGCCCTGCCGATGTGAAGGAGCTCATCCCGGAATTCTTCTACTTTCCT 
GACTTCCTGGAGAACCAGAACGGTTTTGACCTGGGCTGTCTCCAGCTGACCAACGAGAAGGT 
AGGCGATGTGGTGCTACCCCCGTGGGCCAGCTCTCCTGAGGACTTCATCCAGCAGCACCGCC 
AGGCTCTGGAGTCGGAGTATGTGTCTGCACACCTACACGAGTGGATCGACCTCATCTTTGGC 
TACAAGCAGCGGGGGCCAGCCGCCGAGGAGGCCCTCAATGTCTTCTATTACTGCACCTATGA 
GGGGGCTGTAGACCTGGACCATGTGACAGATGAGCGGGAACGGAAGGCTCTGGAGGGCATTA 
TCAGCAACTTTGGGCAGACTCCCTGTCAGCTGCTGAAGGAGCCACATCCAACTCGGCTCTCA 
GCTGAGGAAGCAGCCCATCGCCTTGCACGCCTGGACACTAACTCACCTAGCATCTTCCAGCA 
CCTGGACGAACTCAAGGCATTCTTCGCAGAGGTGACTGTGAGTGCCAGTGGGCTGCTGGGCA 
CCCACAGCTGGTTGCCCTATGACCGCAACATAAGCAACTACTTCAGCTTCAGCAAAGACCCC 
ACCATGGGCAGCCACAAGACGCAGCGACTGCTGAGTGGCCCGTGGGTGCCAGGCAGTGGTGT 
GAGTGGACAAGCACTGGCAGTGGCCCCGGATGGAAAGCTGCTATTCAGCGGTGGCCACTGGG 
ATGGCAGCCTGCGGGTGACTGCACTACCCCGTGGCAAGCTGTTGAGCCAGCTCAGCTGCCAC 
CTTGATGTAGTAACCTGCCTTGCACTGGACACCTGTGGCATCTACCTCATCTCAGGCTCCCG 
GGACACCACGTGCATGGTGTGGCGGCTCCTGCATCAGGGTGGTCTGTCAGTAGGCCTGGCAC 
CAAAGCCTGTGCAGGTCCTGTATGGGCATGGGGCTGCAGTGAGCTGTGTGGCCATCAGCACT 
GAACTTGACATGGCTGTGTCTGGATCTGAGGATGGAACTGTGATCATACACACTGTACGCCG 
CGGACAGTTTGTAGCGGCACTACGGCCTCTGGGTGCCACATTCCCTGGACCTATTTTCCACC 
TGGCATTGGGGTCCGAAGGCCAGATTGTGGTACAGAGCTCAGCGTGGGAACGTCCTGGGGCC 
CAGGTCACCTACTCCTTGCACCTGTATTCAGTCAATGGGAAGTTGCGGGCTTCACTGCCCCT 
GGCAGAGCAGCCTACAGCCCTGACGGTGACAGAGGACTTTGTGTTGCTGGGCACCGCCCAGT 
GCGCCCTGCACATCCTCCAACTAAACACACTGCTCCCGGCCGCGCCTCCCTTGCCCATGAAG 
GTGGCCATCCGCAGCGTGGCCGTGACCAAGGAGCGCAGCCACGTGCTGGTGGGCCTGGAGGA 
TGGCAAGCTCATCGTGGTGGTCGCGGGGCAGCCCTCTGAGGTGCGCAGCAGCCAGTTCGCGC 
GGAAGCTGTGGCGGTCCTCGCGGCGCATCTCCCAGGTGTCCTCGGGAGAGACGGAATACAAC 
CCTACTGAGGCGCGCTGAACCTGGCCAGTCCGGCTGCTCGGGCCCCGCCCCCGGCAGGCCTG 
GCCCGGGAGGCCCCGCCCAGAAGTCGGCGGGAACACCCCGGGGTGGGCAGCCCAGGGGGTGA 
GCGGGGCCCACCCTGCCCAGCTCAGGGATTGGCGGGCGATGTTACCCCCTCAGGGATTGGCG 
GGCGGAAGTCCCGCCCCTCGCCGGCTGAGGGGCCGCCCTGAGGGCCAGCACTGGCGTCT 
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FIGURE 23 

MSQFEMDTYAKSHDLMSGFWNACYDMIJyiSSGQRRQWERAQSRRAFQELVLEP^^ 

RYTAVLKQQATQHSMALLHWGALWRQI^PCGAWALRDTPIPRWKLSSAETYSR^^ 

HHFDPHLEASALRDNLGEVPLTPTEEASLPLAVTKEAKVSTPPELLQEDQLGEDELA^ 

MEAAEUDEQREKLVLSAECQLVTWAVVPGLLEVTTQNVYFYDGSTERVETEEGIGyDFRRP 

lAQLREVHLRRFl^RRSALELFFIDQANYFLNFPCKVGTTPVSSPSQTPRPQPGPIPPHTQV 

RNQVYSWLLRLRPPSQGYLSSRSPQEMLRASGLTQKWQREISNFEYLMQl^IAGRTYNDL 

SQYPWPWVLQDYVSPTLDLSNPAVFRDLSKPIGVVNPKHAQLVREKYESFEDPAGTIDKFH 

YGTHYSNAAGVMHYLIRVEPFTSLHVQLQSGRFDCSDRQFHSVAAAWQARLESPADVKELIP 

EFFYFPDFLENQNGFDLGCLQLTNEKVGDWLPPWASSPEDFIQQHRQALESEYVSAHLHEW 

IDLI FGYKQRGPAAEEAIJ^FYYCrrYEGAVDLDHWDERERKALEGI I SNFGQTPCQLLK^ 

HPTRLSAEEAAHRLARLDTNSPS I FQHLDELKAFFAEVTVSASGLLGTHSWLPYDRNI SNYF 

SFSKDPTMGSHKTQRLLSGPWVPGSGVSGQALAVAPDGKLLFSGGHWDGSLRVTALPRGKLL 

SQLSCHLDWTCLALDTCG I YL I SGSRDTTCM VWRLLHQGGLSVGLAPKPVQVLYGHGAAVS 

CVAI STEIJDMAVSGSEDGTVI I HTVRRGQFVAALRPLGATFPGP I FHIJUiGSEGQI WQSSA 

WERPGAQVTYSLHLYSVNGKLRASLPI^QPTALTVTEDFVLLGTAQCALHILQI^ 

PPLPMKVAIRSVAVTKERSHVLVGLEDGKLIVWAGQPSEVRSSQFARKLWRSSRRISQVSS 
GETEYNPTEAR 
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FIGURE 24 

CGGACGCGTGGGCGGACGCGTGGGGGCTGTGAGAAAGTGCCAATAAATACATCATGCAACCC 

CACGGCCCACCTTGTGAACTCCTCGTGCCCAGGGCTGATGTGCGTCTTCCAGGGCTACTCAT 

CCAAAGGCCTAATCCAACGTTCTGTCTTCAATCTGCAAATCTATGGGGTCCTGGGGCTCTTC 

TGGACCCTTAACTGGGTACTGGCCCTGGGCCAATGCGTCCTCGCTGGAGCCTTTGCCTCCTT 

CrACTGGGCCTTCCACAAGCCCCAGGACATCCCTACCTTCCCCTTAATCTCTGCCTTCATCC 

GCACACTCCGTTACCACACTGGGTCATTGGCATTTGGAGCCCTCATCCTGACCCTTGTGCAG 

ATAGCCCGGGTCATCTTGGAGTATATTGACCACAAGCTCAGAGGAGTGCAGAACCCTGTAGC 

CCGCTGCATCATGTGCTGTTTCAAGTGCTGCCTCTGGTGTCTGGAAAAATTTATCAAGTTCC 

TAAACCGCAATGCATACATCATGATCGCCATCTACGGGAAGAATTTCTGTGTCTCAGCCAAA 

AATGCGTTCATGCTACTCATGCGAAACATTGTCAGGGTGGTCGTCCTGGACAAAGTCACAGA 

CCTGCTGCTGTTCTTTGGGAAGCTGCTGGTGGTCGGAGGCGTGGGGGTCCTGTCCTTCTTTT 

TTTTCTCCGGTCGCATeCCGGGGCTGGGTAAAGACTTTAAGAGCCCCCACCTCAACTATTAC 

TGGCTGCCCATCATGACCTCCATCCTGGGGGCCTATGTCATCGCCAGCGGCTTCTTCAGCGT 

TTTCGGCATGTGTGTGGACACGCTCTTCCTCTGCTTCCTGGAAGACCTGGAGCGGAACAACG 

GCTCCCTGGACCGGCCCTACTACATGTCCAAGAGCCTTCTAAAGATTCTGGGCAAGAAGAAC 

GAGGCGCCCCCGGACAACAAGAAGAGGAAGAAGTGACAGCTCCGGCCCTGATCCAGGACTGC 

ACCCCACCCCCACCGTCCAGCCATCCAACCTCACTTCGCCTTACAGGTCTCCATTTTGTGGT 

AAAAAAAGGTTTTAGGCCAGGCGCCGTGGCTCACGCCTGTAATCCAACACTTTGAGAGGCTG 

AGGCGGGCGGATCACCTGAGTCAGGAGTTCGAGACCAGCCTGGCCAACATGGTGAAACCTCC 

GTCTCTATTAAAAATACAAAAATTAGCCGAGAGTGGTGGCATGCACCTGTCATCCCAGGTAC 

TCGGGAGGCTGAGGCAGGAGAATCGCTTGAACCCGGGAGGCAGAGGTTGCAGTGAGCCGAGA 

TCGCGCCACTGCACTCCAACCTGGGTGACAGACTCTGTCTCCAAAACAAAACAAACAAACAA 

AAAGATTTTATTAAAGATATTTTGTTAACTC 
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FIGIJRK 2j> 

RTRGRTRGGCEKVPINTSCNPTAHLVNSSCPGLMCVFQGYSS KGLI QRSVFNLQI YGVLGLF 

WTLNWVLALGQCVLAGAFASFYWAFHKPQD I PTFPL I S AF IRTLRYHTGSLAFGALILTLVQ 

lARVILEYIDHKLRGVQNPVARCIMCCFKCCLWCLEKFIKFLNRNAYIMIAIYGKNFCVSAK 

NAFMLLMRNIWVVVLDKVTDLLLFFGKIiWGGVGVLSFFPFSGRIPGIXSKDFKSPHLNYY 

WLPIMTSILGAYVIASGFFSVFGMCVDTLFLCFLEDLERNNGSLDRPYYMSKSLLKILGKKN 
EAPPDNKKRKK 
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FIGURE 26 

GAGTCTTGACCGCCGCCGGGCTCTTGGTACCTCAGCGCGAGCGCCAGGCGTCCGGCCGCCGT 

GGCTaa^TTCGTGTCCGATTTCCGCAAAGAGTTCTACGAGGTGGTCCAGAGCCAGAGGGTCC 

TTCTCTTCGTGGCCTCGGACGTGGATGCTCTGTGTGCGTGCAAGATCCTTCAGGCCTTGTTC 

CAGTGTGACCACGTGO^TATACGCTGGTTCCAGTTTCTGGGTGGCAAGAACTTGAAACT^ 

ATTTCTTGAGCATAAAGAACAGTTTCATTATTTTATTCTCATAAACTGTGGAGCTAATGTAG 

ACCTATTGGATATTCTTCAACCTGATGAAGACACTATATTCTTTGTGTGTGACTCCCATAGG 

CCAGTCAATGTCGTCAATGTATACAACGATACCCAGATCAAATTACTCATTAAACAAGATGA 

TGACCTTGAAGTTCCCGCCTATGAAGACATCTTCAGGGATGAAGAGGAGGATGAAGAGCATT 

CAGGAAATGACAGTGATGGGTCAGAGCCTTCTGAGAAGCGCACACGGTTAGAAGAGGAGATA 

GTGGAGCAAACCATGCGGAGGAGGCAGCGGCGAGAGTGGGAGGCCCGGAGAAGAGACATCCT 

CTTTGACTACGAGCAGTATGAATATCATGGGACATCGTCAGCCATGGTGATGTTTGAGCTGG 

CTTGGATGCTGTCCAAGGACCTGAATGACATGCTGTGGTGGGCCATCGTTGGACTAACAGAC 

CAGTGGGTGCAAGACAAGATCACTCAAATGAAATACGTGACTGATGTTGGTGTCCTGCAGCG 

CCACGTTTCCCGCCACAACCACCGGAACGAGGATGAGGAGAACACACTCTCCGTGGACTGCA 

CACGGATCTCCTTTGAGTATGACCTCCGCCTGGTGCTCTACCAGCACTGGTCCCTCCATGAC 

AGCCTGTGCAACACCAGCTATACCGCAGCCAGGTTCAAGCTGTGGTCTGTGCATGGACAGAA 

GCGGCTCCAGGAGTTCCTTGCAGACATGGGTCTTCCCCTGAAGCAGGTGAAGCAGAAGTTCC 

AGGCCATGGACATCTCCTTGAAGGAGAATTTGCGGGAAATGATTGAAGAGTCTGCAAATAAA 

TTTGGGATGAAGGACATGCGCGTGCAGACTTTCAGCATTCATTTTGGGTTCAAGCACAAGTT 

TCTGGCCAGCGACGTGGTCTTTGCCACCATGTCTTTGATGGAGAGCCCCGAGAAGGATGGCT 

CAGGGACAGATCACTTCATCCAGGCTCTGGACAGCCTCTCCAGGAGTT^CCTGGACAAGCTG 

TACCATGGCCTGGAACrCGCCAAGAAGCAGCTGCGAGCCACCCAGCAGACCATTGCCAGCT^ 

CTTTGCACCAACCTCGTCATCTCCCAGGGGCCTTTCCTGTACTGCTCTCTCATGGAGGGCAC 

TCCAGATGTCATGCTGTTCTCTAGGCCGGCATCCCTAAGCCTGCTCAGCAAACACCTGCTCA 

AGTCCTTTGTGTGTTCGACAAAGAACCGGCGCTGCAAACTGCTGCCCCTGGTGATGGCTGCC 

CCCCTGAGCATGGAGCATGGCACAGTGACCGTGGTGGGCATCCCCCCAGAQACCGACAGCTC 

GGACAGGAAGAACTTTTTTGGGAGGGCGTTTGAGAAGGCAGCGGAAAGCACCAGCTCCCGGA 

TGCrGCACAACCATTTTGACCTCTO^GTAATTGAGCTGAAAGCTGAGGATCGGAGCAAGTTT 

CTGGACGCACTTATTTCCCTCCTGTCCTAGGAATTTGATTCTTCCAGAATGACCTTCTTAT^ 

TATGTAACTGGCTTTCATTTAGATTGTAAGTTATGGACATGATTTGAGATGTAGAAGCCATT 

TTTTATTAAATAAAATGCTTATTTTAGGAAA 
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FIGURE 27 

MFVSDFRKEFYEWQSQRVLLFVASDVDALCACKILQALFQCDHVQYTLVPVSGWQELETAF 

LEHKEQFHYFILINCGANVDLLDI LQPDEDTI FFVCDSHRPVNWNVYNDTQIKLLIKQDDD 

LEVPAYEDIFRDEEEDEEHSGNDSDGSEPSEKRTRLEEEIVEQTMRRRQRREWEARRRDILF 

DYEQyEYHGTSSAMVMFEIAWMLSIa)LNDML^WAI.VGLTDQWVQDKITQ^IKYVTDVGVl^ 

VSRHiraiWEDEENTLSVDCTRI SFEYDIJlLVLYQHWSLHDSLan'SyTAARFKLWSVHGQKR 

LQEFLADMGLPLKQVKQKFQAMDISLKENLREMIEESANKFGMKDMRVQTFSIHFGFKHKFL 

ASDWFATMSLMESPEKDGSGTDHFIQALDSLSRSNLDKLYHGLELAKKQLRATQQTIASCL 

CTNLVISQGPFLYCSLMEGTPDVMLFSRPASLSLLSKHLLKSFVCSTKNRRCKLLPLVMAAP 

LSMEHGTVTWGIPPETDSSDRKNFFGRAFEKAAESTSSRMLHNHFDLSVIELKAEDRSKFL 
DALISLLS 
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FIGURE 28 

GTACCTCAGCGCGAGCGCCAGGCGTCCGGCCGCCGTGGCTATGNTCGTGTCCGATTTCCGCA 

AAGAGTTCTACGAGGTGGTCCAGAGCCAGAGGGTCCTTCTCTTCGTGGCCTCGGANGTGGAT 

GCTCTGTGTGCGTGCAAGATCCTTCAGGCCTTGTTCCAGTGTGACCANGTGCAATATANGCT 

GGTTCCAGTTTCTGGGTGGCAAGAACTT6AAACTGCATTTCTTGAGCATAAAGAACAGTTTC 

ATTATTTTATTCTCATAAACTGTGGAGCTAATGTAGACCTATTGGATATTCTTCAACCTGAT 

GAAGACACTATATTCTTTGTGTGTGACACCCATAGGCCAGTCAATGTTGTCAATGTATACAA 
CGATACCC 
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FIGURE 29 
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||ps??^^^is?^--— ^^^^ 

AACCTCTGCCTCCCGGGTTCAAGCGATTCTTCTGCCTCAGCCTC^^ 

aggcgcctgctaccatgcctggctaatttttgtStttagtIgSIt^^^ 
ttggccaggctggtcttgaactcctgacttcaggtScccacccgcc^^ 

2SSSS^r^^°°^°T^GC^CCGTGCCT5Gc6AA?T?^^ 

SS^Io^^S^^^^^^^^t^^atgttctctttaccccttccccctct^ 

^SS^SSH^^^^^^^^^C'^'TTCATTTATTCATTCATTAAACA^^^ 

SSI^^S°^°^^^^^agg<=ggtcagacacaggccctSS:cct 

?Ij;j2^S^AGGCGGGGAGAGATGTGTACATAGGTTTT^(SS^ 

GCCTGTGTTCTGGGTGTTCAGGTGCTGCTGGTCCTCCATOCC^^ 

^^52^CGGGGTCCCGGTGGCAGGGGCAGGTATCTC(SrC(^^^^ 

TGCTCATCGTTACAGCAAACCCCAGGGGGCCTTGGCCAGGTciiGGSTC^^^ 

J??S?S?2^^^^^^^^^^'^^G<^GGTGAAG?G^^^^ 

TAGCTCTCCCCACAGCTGATACGGCATCCTGCGAGAAGACCTGCCCTOT 

SIr5JS?^;^^'^^^^^°°^CT^GCCAGGGCC??5??^^ 

S^I^I^^CGTTTCCCCAGGGCCTCCAGCTGCCCTCAGACACTGATGTCTGT^ 

S^™^^^^^'^<^^^^ctctcaccggcccagtgccccgactctcSgg^^^ 
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FIGURE .^0 

MVTAALGPVWAALLLFLLMCEIRMVELTFDRAVASGCQRCCDSEDPLDPAHVSSASSSGRPH 

ALPEIRPYINITILKGDKGDPGPMGLPGYMGREGPQGEPGPQGSKGDKGEMGSPGAPCQKRF 

FAFSVGRKTALHSGEDFQTLLFERVFVNLDGCFDMATGQFAAPLRGIYFFSLNVHSWNYKET 

YVHIMHNQKEAVILYAQPSERSIMQSQSVMLDIAYGDRVWVRLFKRQRENAIYSNDFDTYIT 
FSGHLIKAEDD 



In^erfcant features : 
Signal peptide: 

amino acids 1-20 

N-glycosylation site, 
amino acids 72-75. 



Clq domain proteins. 

amino acids 144-178, 78-111 and 84-117 
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FIGURE .^1 

ACTCGAACGCAGTTGCTTCGGGACCCAGGACCCCCTCGGGCCCGACCCGCCAGGAAAGACTG 

AGGCCGCGGCCTGCCCCGCCCGGCTCCCTGCGCCGCCGCCGCCTCCCGGGACAGAAGATGTG 

CTCCAGGGTCCCTCTGCTGCTGCCGCTGCTCCTGCTACTGGCCCTGGGGCCTGGGGTGCAGG 

GCTGCCCATCCGGCTGCCAGTGCAGCCAGCCACAGACAGTCTTCTGCACTGCCCGCCAGGGG 

ACCACGGTGCCCCGAGACGTGCCACCCGACACGGTGGGGCTGTACGTCTTTGAGAACGGCAT 

CACCATGCTCGACGCAGGCAGCTTTGCCGGCCTGCCGGGCCTGCAGCTCCTGGACCTGTCAC 

AGAACCAGATCGCCAGCCTGCCCAGCGGGGTCTTCCAGCCACTCGCCAACCTCAGCAACCTG 

GACCTGACGGCCAACAGGCTGCATGAAATCACCAATGAGACCTTCCGTGGCCTGCGGCGCCT 

CGAGCGCCTCTACCTGGGCAAGAACCGCATCCGCCACATCCAGCCTGGTGCCTTCGACACGC 

TCGACCGCCTCCTGGAGCTCAAGCTGCAGGACAACGAGCTGCGGGCACTGCCCCCGCTGCGC 

CTGCCCCGCCTGCTGCTGCTGGACCTCAGCCACAACAGCCTCCTGGCCCTGGAGCCCGGCAT 

CCTGGACACTGCCAACGTGGAGGCGCTGCGGCTGGCTGGTCTGGGGCTGCAGCAGCTGGACG 

AGGGGCTCTTCAGCCGCTTGCGCAACCTCCACGACCTGGATGTGTCCGACAACCAGCTGGAG 

CGAGTGCCACCTGTGATCCGAGGCCTCCGGGGCCTGACGCGCCTGCGGCTGGCCGGCAACAC 

CCGCATTGCCCAGCTGCGGCCCGAGGACCTGGCCGGCCTGGCTGCCCTGCAGGAGCTGGATG 

TGAGCAACCTAAGCCTGCAGGCCCTGCCTGGCGACCTCTCGGGCCTCTTCCCCCGCCTGCGG 

CTGCTGGCAGCTGCCCGCAACCCCTTCAACTGCGTGTGCCCCCTGAGCTGGTTTGGCCCCTG 

GGTGCGCGAGAGCCACGTCACACTGGCCAGCCCTGAGGAGACGCGCTGCCACTTCCCGCCCA 

AGAACGCTGGCCGGCTGCTCCTGGAGCTTGACTACGCCGACTTTGGCTGCCCAGCCACCACC 

ACCACAGCCACAGTGCCCACCACGAGGCCCGTGGTGCGGGAGCCCACAGCCTTGTCTTCTAG 

CTTGGCTCCTACCTGGCTTAGCCCCACAGCGCCGGCCACTGAGGCCCCCAGCCCGCCCTCCA 

CTGCCCCACCGACTGTAGGGCCTGTCCCCCAGCCCCAGGACTGCCCACCGTCCACCTGCCTC 

AATGGGGGCACATGCCACCTGGGGACACGGCACCACCTGGCGTGCTTGTGCCCCGAAGGCTT 

CACGGGCCTGTACTGTGAGAGCCAGATGGGGCAGGGGACACGGCCCAGCCCTACACCAGTCA 

CGCCGAGGCCACCACGGTCCCTGACCCTGGGCATCGAGCCGGTGAGCCCCACCTCCCTGCGC 

GTGGGGCTGCAGCGCTACCTCCAGGGGAGCTCCGTGCAGCTCAGGAGCCTCCGTCTCACCTA 

TCGCAACCTATCGGGCCCTGATAAGCGGCTGGTGACGCTGCGACTGCCTGCCTCGCTCGCTG 

AGTACACGGTCACCCAGCTGCGGCCCAACGCCACTTACTCCGTCTGTGTCATGCCTTTGGGG 

CCCGGGCGGGTGCCGGAGGGCGAGGAGGCCTGCGGGGAGGCCCATACACCCCCAGCCGTCCA 

CTCCAACCACGCCCCAGTCACCCAGGCCCGCGAGGGCAACCTGCCGCTCCTCATTGCGCCCG 

CCCTGGCCGCGGTGCTCCTGGCCGCGCTGGCTGCGGTGGGGGCAGCCTACTGTGTGCGGCGG 

GGGCGGGCCATGGCAGCAGCGGCTCAGGACAAAGGGCAGGTGGGGCCAGGGGCTGGGCCCCT 

GGAACTGGAGGGAGTGAAGGTCCCCTTGGAGCCAGGCCCGAAGGCAACAGAGGGCGGTGGAG 

AGGCCCTGCCCAGCGGGTCTGAGTGTGAGGTGCCACTCATGGGCTTCCCAGGGCCTGGCCTC 

CAGTCACCCCTCCACGCAAAGCCCTACATCTAAGCCAGAGAGAGACAGGGCAGCTGGGGCCG 

GGCTCTCAGCCAGTGAGATGGCCAGCCCCCTCCTGCTGCCACACCACGTAAGTTCTCAGTCC 

CAACCTCGGGGATGTGTGCAGACAGGGCTGTGTGACCACAGCTGGGCCCTGTTCCCTCTGGA 

CCTCGGTCTCCTCATCTGTGAGATGCTGTGGCCCAGCTGACGAGCCCTAACGTCCCCAGAAC 

CGAGTGCCTATGAGGACAGTGTCCGCCCTGCCCTCCGCAACGTGCAGTCCCTGGGCACGGCG 

GGCCCTGCCATGTGCTGGTAACGCATGCCTGGGTCCTGCTGGGCTCTCCCACTCCAGGCGGA 

CCCTGGGGGCCAGTGAAGGAAGCTCCCGGAAAGAGCAGAGGGAGAGCGGGTAGGCGGCTGTG 

TGACTCTAGTCTTGGCCCCAGGAAGCGAAGGAACAAAAGAAACTGGAAAGGAAGATGCTTTA 

GGAACATGTTTTGCTTTTTTAAAATATATATATTTATAAGAGATCCTTTCCCATTTATTCTG 

GGAAGATGTTTTTCAAACTCAGAGACAAGGACTTTGGTTTTTGTAAGACAAACGATGATATG 

AAGGCCTTTTGTAAGAAAAAATAAAAGATGAAGTGTGAAA 



PMsnrmrv <viin nsBaoBaA: i > 
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PCT/US99/12252 



MCSRVPLLLPLLLLLALGPGVQGCPSGCQCSQPQTVFCTARQGTTVPRDVPPDTVGLYVFEN 
GITMLDAGSFAGLPGLQLLDLSQNQIASLPSGVFQPLANLSNLDLTANRLHEITNETFRGLR 
RLERLYLGKimiRHIQPGAFDTI^RLI^LKIXJDNELRALPPLRLPRLLLLDLSHNSLLALEP 
GILDTANVEALRLAGLGLQQUJEGLFSRLRNLHDLDVSDNQLERVPPVIRGLRGLTRLRIAG 
NTRIAQLRPEDLAGLAALQELDVSNLSLQALPGDLSGLFPRLRLLAAARNPFNCVCPLSWFG 
PWWESHVTLASPEETRCHFPPKNAGRLLLELDYADFGCPATTTTATVPTTRPVVREPTALS 
SSLAPTWLSPTAPATEAPSPPSTAPPTVGPVPQPQDCPPSTCLNGGTCHLGTRHHLACLCPE 
GFTGLYCESQMGQGTRPSPTPVTPRPPRSLTLGIEPVSPTSLRVGLQRYLQGSSVQLRSLRL 
TYRNLSGPDKRLVTLRLPASLAEYTVTQLRPNATYSVCVMPLGPGRVPEGEEACGEAHTPPA 
VHSNHAPVTQAREGNLPLLIAPALAAVLLAALAAVGAAYCVRRGRAMAAAAQDKGQVGPGAG 
PLELEGVKVPLEPGPKATEGGGEALPSGSECEVPLMGFPGPGLQSPLHAKPYI 
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AAAC 



OTGGGGCAAAAATACAAAAATCAGATTTCCAGAAAATTGCI^ATCT^ 

atgcaacgaaatcttagtttagaaaatgctaagacatcgotct^S 

mACTCTGGGACGACCTTTTCCTTATCTTACAATTTGTTOcS^^ 



iG^^c^^^~^^ 

S^^SJ^G^^g^ASG^GlS?^^^^ 

S^taaaagacgttcatctccacgaattatcttgcaacaSc^^ 
tggttattatgctagttctggggttggctgtggccttctgctgtctcSct™ 

AGAACAACTCAAGAGAAATGTCCGATTCCACGCATTTATTTCATACAGTSACATrA^T^^ 

JS^S^S^^'^^^^^^^'^ccccaatctagagaaggS^sa^^ 

?Iir?Jn^?^'^^^'^^°^^^^T°°^AAAAGCATTAGTGAASTA^^^ 

^^^^^^^^^'^"^^^atctttgttttgtctcccaactttctccagaatga^^^ 

S^GAATTCTACTTTGCCCACCACAATCTCTTCCATGAAAATTCTG^ 
^J^^r^^^^^^^'^'^^^ATTCTATTGCATTCCCACCA^ATCAT^^ 
^^^^Sl^^GCATACTTGGAATGGCCCAAGGATAGGCGTAi^^^ 
ACCTTCGAGCTGCTATTAATGTTAATGTATTAGCCACCAG^AATOT^ 

ttcacagagttaaatgaagagtctcgaggttctSSatctc?^a?^^^ 

ataaaatcccacagtccttgggaagttggggaccacatacactStSg^^ 

S^^S™tgatggcaatttgacaatatttattSt^^ 

^^^SriI^^^^^°^^"^TT^TAAGAATGTATCCTATAGAAACACCTT^^ 

^^™1^°^^^^^ggtgttcatcccaggattgtttata^atgSaS^^ 
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FIGURE 

MRLIRNIYIFCSIVMTAEGDAPELPEERELMTNCSNMSLRKVPADLTPATTTLDLSYNLLFQ 

I^SSDFHSVSKLRVLILCHNRIQQLDLKTFEFNKELRYIJDLSNNRLKSVTWYL^GLRYLDL 

SFNDFDTMPICEEAGNMSHLEILGLSGAKIQKSDFQKIAHLHLNTVFLGFRTLPHYEEGSLP 

IIOTTKIJIIVLPMDTNFWVLLRDGIKTSKILEMTNIDGKSQPVSyEMORNLSLENAKTSVLL 

IJJKVDLLWDDLFLILQFVWHTSVEHFQIRNVTFGGKAYLDHNSFDYSNTVMRTIKLEHVH^ 

VFYIQQDKIYLLLTKMDIENLTISNAQMPHMLFPNYPTKFQYLNFANNILTDELFKRTIQLP 

HLKTLILNGNKLETLSLVSCFANNTPLEHLDLSQNLLQHKNDENCSWPETWNMNLSYNKLS 

DSVFRCLPKSIQILDLNNNQIQTVPKETIHLMALRELNIAFNFLTDLPGCSHFSRLSVLNIE 

MNFILSPSLDFVQSCQEVKTLNAGRNPFRCTCELKNFIQLETYSEVMMVGWSDSYTCEYPLN 

LRGTRLKDVHLHELSCNTALLIVTIVVIMLVLGLAVAFCCLHFDLPWYLRMLGQCTQTWHRV 

RKTTQEQLKRNVRFHAFISYSEHDSLWVKNELIPNLEKEDGSILICLYESYFDPGKSISENI 

VSFIEKSYKSIFVLSPNFVQNEWCHYEFYFAHHNLFHENSDHIILILLEPIPFYCIPTRYHK 

LKALLEKKAYLEWPKDRRKCGLFWANLRAAINVNVLATREMYELQTFTELNEESRGSTISLM 
RTDCL 
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FIGURE .lj>A 

GGGGGCTTTCTTGGGCTTGGCTGCTTGGAACACCTGCCTCCAAGGACCGGCCTCGGAGGGGT 

CGCCGGGAAAGGGAGGGAAGAAGGAAGGGCGGGGCCGGCCCCCCTGCGCCCGCCCCGCGCCT 

CTGCGCGCCCCTGTCCGCCCCGGCCCAGCCCAGCCCAGCCCCGCGGGCCGGTCACACGCGCA 

GCCAGCCGGCCGCCTCCCGCGCCCAAGCGCGCCGCTCTGCTGTGCCCTGCGCCCTTGCCCCG 

CGCCAGCTTCTGCGCCCGCAGCCCGCCCGGCGCCCCCGGTGACCGTGACCCTGCCCTGGGCG 

CGGGGCGGAGCAGGCATGTCCCGCCCGGGGACCGCTACCCCAGCGCTGGCCCTGGTGCTCCT 

GGCAGTGACCCTGGCCGGGGTCGGAGCCCAGGGCGCAGCCCTCGAGGACCCTGATTATTACG 

GGCAGGAGATCTGGAGCCGGGAGCCCTACTACGCGCGCCCGGAGCCCGAGCTCGAGACCTTC 

TCTCCGCCGCTGCCTGCGGGGCCCGGGGAGGAGTGGGAGCGGCGCCCGCAGGAGCCCAGGCC 

GCCCAAGAGGGCCACCAAGCCCAAGAAAGCTCCCAAGAGGGAGAAGTCGGCTCCGGAGCCGC 

CTCCACCAGGTAAACACAGCAACAAAAAAGTTATGAGAACCAAGAGCTCTGAGAAGGCTGCC 

AACGATGATCACAGTGTCCGTGTGGCCCGTGAAGATGTCAGAGAGAGTTGCCCACCTCTTGG 

TCTGGAAACCTTAAAAATCACAGACTTCCAGCTCCATGCCTCCACGGTGAAGCGCTATGGCC 

TGGGGGCACATCGAGGGAGACTCAACATCCAGGCGGGCATTAATGAAAATGATTTTTATGAC 

GGAGCGTGGTGCGCGGGAAGAAATGACCTCCAGCAGTGGATTGAAGTGGATGCTCGGCGCCT 

CATCCTATAAGGTCATGGTGAGCAATGACAGCCACACGTGGGTCACTGTTAAGAATGGATCT 
GGAGACATGATATTTGAGGGAAACAGTGAGAAGGAGATCCCTGTTCTCAATGAGCTACCCGT 
CCCCATGGTGGCCCGCTACATCCGCATAAACCCTCAGTCCTGGTTTGATAATGGGAGCATCT 
GCATGAGAATGGAGATCCTGGGCTGCCCACTGCCAGATCCTAATAATTATTATCACCGCCGG 
AACGAGATGACCACCACTGATGACCTGGATTTTAAGCACCACAATTATAAGGAAATGCGCCA 

GCCACCAGGGCCTGAAGCTGTATGCTGTGGAGATCTCAGATCACCCTGGGGAGCATGAAGTC 
GGTGAGCCCGAGTTCCACTACATCGCGGGGGCCCACGGCAATGAGGTGCTGGGCCGGGAGCT 
GCTGCTGCTGCTGGTGCAGTTCGTGTGTCAGGAGTACTTGGCCCGGAATGCGCGCATCGTCC 
^.^^■^''''''''''''''''''''''^''''''''^''''^''^"^^^^^^^^TCAACCCC^^^ 

GCCTACGAAGGGGGCTCGGAGCTGGGAGGCTGGTCCCTGGGACGCTGGACCCACGATGGAAT 
I^^^^^J^^'^'^^^^^^^^^T^'^TAAACACGCTGCTCTGG^^ 

ATGTCCCCAGGAAAGTTCCCAATCACTATATTGCAATCCCTGAGTGGTTTCTGTCGGAAAAT 
GCCACGGTGGCTGCCGAGACCAGAGCAGTCATAGCCTGGATGGAAAAAATCCCTTTTGTGCT 
GGGCGGCAACCTGCAGGGCGGCGAGCTGGTGGTGGCGTATCCCTACGACCTGGTGCGGTCCC 
CCTGGAAGACGCAGGAACACACCCCCACCCCCGATGACCACGTGTTCCGCTGGCTGGCCTAC 

CTTCCAGAAGGAGGAGGGCACTGTCAATGGGGCCTCCTGGCACACCGTCGCTGGAAGTCTGA 
ACGATTTCAGCTACCTTCATACAAACTGCTTCGAACTGTCCATCTACGTGGGCTGTGATAAA 
TACCCACATGAGAGCCAGCTGCCCGAGGAGTGGGAGAATAACCGGGAATCTCTGATCGTGTT 
CATGGAGCAGGTTCATCGTGGCATTAAAGGCTTGGTGAGAGATTCACATGGAAAAGGAATCC 
CAAACGCCATTATCTCCGTAGAAGGCATTAACCATGACATCCGAACAGCCAACGATGGGGAT 
TACTGGCGCCTCCTGAACCCTGGAGAGTATGTGGTCACAGCAAAGGCCGAAGGTTTCACTGC 
ATCCACCAAGAACTGTATGGTTGGCTATGACATGGGGGCCACAAGGTGTGACTTCACACTTA 
GCAAAACCAACATGGCCAGGATCCGAGAGATCATGGAGAAGTTTGGGAAGCAGCCCGTCAGC 
CTGCCAGCCAGGCGGCTGAAGCTGCGGGGGCGGAAGAGACGACAGCGTGGGTGACCCTCCTG 
GGrcCTTGAGACTCGTCTGGGACCCATGCAAATTAAACCAACCTGGTAGTA^CCATAGTG 
GACTCACTCACTGTTGTTTCCTCTGTAATTCAAGAAGTGCCTGGAAGAGAGO^^^^ 

AGGCAGGTCCCAAAAGGGAAGGCTGGAGGCTGAGGCTGTTTTCTTTTCTTTGTTCCCATTTA 
I^2^^'^'^'^^^°°''^'''^''°^'^^''°''°^°^^^A'rGGGAGTGAGAGA^^^ 

AACCTGGGAATCAGAGAGAGAAGGAGAAGGAGGGGAGCCTGTCCGTTCAGAGCCTCTGGCTGC 



BNSDOCtD: <WO_99S30e8A2J_> 
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FIGURE IfsR 

AATTTGCACAGCTAAAATTGCAGCATTTCCCCAGCTGGGCTGTCCCAAATG^^^^ 

gatgctcccaggcgtcctaagagaatccaccctctctggccctS 

gatcattcaggagtttgttgggcagcaagcatggagcttcttgcacaaattct^^ 
aacaacccccaaagtccctgctgatccagtagccctggaggttccc^ 
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FIGURE 36 
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MSRPGTATPALALVLLAVTLAGVGAQGAALEDPDYYGQEIWSREPYYARPEPELETFSPPLP 

AGPGEEWERRPQEPRPPKRATKPKKAPKREKSAPEPPPPGKHSNKKVMRTKSSEKAANDDHS 

VRVAREDVRESCPPLGLETLKITDFQLHASTVKRYGLGAHRGRLNIQAGINENDFYDGAWCA 

GRNDLQQWIEVDARRLTRFTGVITQGRNSLWLSDWVTSYKVMVSNDSHTWVTVKNGSGDMIF 

EGNSEKEIPVLNELPVPMVARYIRINPQSWFDNGSICMRMEILGCPLPDPNNYYHRRNEMTT 

TDDLDFKHHNYKEMRQLMKWNEMCPNITRIYNIGKSHQGLKLYAVEISDHPGEHEVGEPEF 

HYIAGAHGNEVLGRELLLLLVQFVCQEYLARNARIVHLVEETRIHVLPSLNPDGYEKAYEGG 

SELGGWSLGRWTHDGIDINNNFPDLNTLLWEAEDRQNVPRKVPNHYIAIPEWFLSENATVAA 

ETRAVIAWMEKIPFVLGGNLQGGELWAYPYDLVRSPWKTQEHTPTPDDHVFRWLAYSYAST 

HRLMTDARRRVCHTEDFQKEEGTVNGASWHTVAGSLNDFSYLHTNCFELSIYVGCDKYPHES 

QLPEEWENNRESLIVFMEQVHRGIKGLVRDSHGKGI PNAI ISVEGINHDIRTANDGDYWRLL 

NPGEYWTAKAEGFTASTKNCMVGYDMGATRCDFTLSKTNMARIREIMEKFGKQPVSLPARR 
LKLRGRKRRQRG 
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FIGURR 37 

CTAAGAGGACAAGATGAGGCCCGGCCTCTCATTTCTCCTAGCCCTTCTGTTCTTCCTTGGCC 

AAGCTGCAGGGGATTTGGGGGATGTGGGACCTCCAATTCCCAGCCCCGGCTTCAGCTCTTTC 

CCAGGTGTTGACTCCAGCTCCAGCTTCAGCTCCAGCTCCAGGTCGGGCTCCAGCTCCAGCCG 

CAGCTTAGGCAGCGGAGGTTCTGTGTCCCAGTTGTTTTCCAATTTCACCGGCTCCGTGGATG 

ACCGTGGGACCTGCCAGTGCTCTGTTTCCCTGCCAGACACCACCTTTCCCGTGGACAGAGTG 

GAACGCTTGGAATTCACAGCTCATGTTCTTTCTCAGAAGTTTGAGAAAGAACTTTCTAAAGT 

GAGGGAATATGTCCAATTAATTAGTGTGTATGAAAAGAAACTGTTAAACCTAACTGTCCGAA 

TTGACATCATGGAGAAGGATACCATTTCTTACACTGAACTGGACTTCGAGCTGATCAAGGTA 

GAAGTGAAGGAGATGGAAAAACTGGTCATACAGCTGAAGGAGAGTTTTGGTGGAAGCTCAGA 

AATTGTtGACCAGCTGGAGGTGGAGATAAGAAATATGACTCTCTTGGTAGAGAAGCTTGAGA 

CACTAGACAAAAACAATGTCCTTGCCATTCGCCGAGAAATCGTGGCTCTGAAGACCAAGCTG 

AAAGAGTGTGAGGCCTCTAAAGATCAAAACACCCCTGTCGTCCACCCTCCTCCCACTCCAGG 

GAGCTGTGGTCATGGTGGTGTGGTGAACATCAGCAAACCGTCTGTGGTTCAGCTCAACTGGA 

GAGGGTTTTCTTATCTATATGGTGCTTGGGGTAGGGATTACTCTCCCCAGCATCCAAACAAA 

GGACTGTATTGGGTGGCGCCATTGAATACAGATGGGAGACTGTTGGAGTATTATAGACTGTA 

CAACACACTGGATGATTTGCTATTGTATATAAATGCTCGAGAGTTGCGGATCACCTATGGCC 

AAGGTAGTGGTACAGCAGTTTACAACAACAACATGTACGTCAACATGTACAACACCGGGAAT 

ATTGCCAGAGTTAACCTGACCACCAACACGATTGCTGTGACTCAAACTCTCCCTAATGCTGC 

CTATAATAACCGCTTTTCATATGCTAATGTTGCTTGGCAAGATATTGACTTTGCTGTGGATG 

AGAATGGATTGTGGGTTATTTATTCAACTGAAGCCAGCACTGGTAACATGGTGATTAGTAAA 

CTCAATGACACCACACTTCAGGTGCTAAACACTTGGTATACCAAGCAGTATAAACCATCTGC 

TTCTAACGCCTTCATGGTATGTGGGGTTCTGTATGCCACCCGTACTATGAACACCAGAACAG 

AAGAGATTTTTTACTATTATGACACAAACACAGGGAAAGAGGGCAAACTAGACATTGTAATG 

CATAAGATGCAGGAAAAAGTGCAGAGCATTAACTATAACCCTTTTGACCAGAAACTTTATGT 

CTATAACGATGGTTACCTTCTGAATTATGATCTTTCTGTCTTGCAGAAGCCCCAG3EAAGCTG 

TTTAGGAGTTAGGGTGAAAGAGAAAATGTTTGTTGAAAAAATAGTCTTCTCCACTTACTTAG 

ATATCTGCAGGGGTGTCTAAAAGTGTGTTCATTTTGCAGCAATGTTTAGGTGCATAGTTCTA 

CCACACTAGAGATCTAGGACATTTGTCTTGATTTGGTGAGTTCTCTTGGGAATCATCTGCCT 

CTTCAGGCGCATTTTGCAATAAAGTCTGTCTAGGGTGGGATTGTCAGAGGTCTAGGGGCACT 

GTGGGCCTAGTGAAGCCTACTGTGAGGAGGCTTCACTAGAAGCCTTAAATTAGGAATTAAGG 

AACTTAAAACTCAGTATGGCGTCTAGGGATTCTTTGTACAGGAAATATTGCCCAATGACTAG 

TCCTCATCCATGTAGCACCACTAATTCTTCCATGCCTGGAAGAAACCTGGGGACTTAGTTAG 

GTAGATTAATATCTGGAGCTCCTCGAGGGACCAAATCTCCAACTTTTTTTTCCCCTCACTAG 

CACCTGGAATGATGCTTTGTATGTGGCAGATAAGTAAATTTGGCATGCTTATATATTCTACA 

TCTGTAAAGTGCTGAGTTTTATGGAGAGAGGCCTTTTTATGCATTAAATTGTACATGGCAAA 

TAAATCCCAGAAGGATCTGTAGATGAGGCACCTGCTTTTTCTTTTCTCTCATTGTCCACCTT 

ACTAAAAGTCAGTAGAATCTTCTACCTCATAACTTCCTTCCAAAGGCAGCTCAGAAGATTAG 

AACCAGACTTACTAACCAATTCCACCCCCCACCAACCCCCTTCTACTGCCTACTTTAAAAAA 

ATTAATAGTTTTCTATGGAACTGATCTAAGATTAGAAAAATTAATTTTCTTTAATTTCATTA 

TGGACTTTTATTTACATGACTCTAAGACTATAAGAAAATCTGATGGCAGTGACAAAGTGCTA 

GCATTTATTGTTATCTAATAAAGACCTTGGAGCATATGTGCAACTTATGAGTGTATCAGTTG 

TTGCATGTAATTTTTGCCTTTGTTTAAGCCTGGAACTTGTAAGAAAATGAAAATTTAATTTT 

TTTTTCTAGGACGAGCTATAGAAAAGCTATTGAGAGTATCTAGTTAATCAGTGCAGTAGTTG 

GAAACCTTGCTGGTGTATGTGATGTGCTTCTGTGCTTTTGAATGACTTTATCATCTAGTCTT 

TGTCTATTTTTCCTTTGATGTTCAAGTCCTAGTCTATAGGATTGGCAGTTTAAATGCTTTAC 

TCCCCCTTTTAAAATAAATGATTAAAATGTCCTTTGAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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MRPGLSFLLALLFFLGQAAGDLGDVGPPIPSPGFSSFPGVDSSSSFSSSSRSGSSSSRSLGS 

GGSVSQLFSNFTGSVDDRGTCQCSVSLPDTTFPVDRVERLEFTAHVLSQKFEKELSKVREYV 

QLISVYEKKLLNLTVRIDIMEKDTISYTELDFELIKVEVKEMEKLVIQLKESFGGSSEIVDQ 

LEVEIRNMTLLVEKLETLDKNNVLAIRREIVALKTKLKECEASKDQNTPWHPPPTPGSCGH 

GGVVNISKPSWQIJaWRGFSYLYGAWGRDySPQHPNKGLYWAPLNTOGRLLEYYRLYNTLD 

DLLLYINARELRITYGQGSGTAVYNNNMYVNMYNTGNIARVNLTTNTIAVTQTLPNAAYNNR 

FSYANVAWQDIDPAWENGLWIYSTEASTGNMVISKLNDTTLQVLNTWYTKQYKPSASNAF 

MVCGVLYATRTMNTRTEEIFYYYDTNTGKEGKLDIVMHKMQEKVQSINYNPFDQKLYVYNDG 
YLLNYDLSVLQKPQ 
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FIGURE 

GCTCTGAAGACCAAGCTGAAAGAGTGTGAGGCCTCTAAAGATCAAACACCCCI^TCGTC^^^ 
CCTCCTCCCACTCCAGGGAGCTGTGGTCATGGTGGTGTGGTGAACATCAGCAAAC^^^ 

ggttcagctcaactggagagggitttcttatctatatggtgcttggggtaggga™^^ 

CCCAGCATCCAAACAAAGGNATGTATTGGGNGGCGCCAITGAATACAGATGGGAGACT^^^ 

GAGTATTATAGACTGTACAACCCACTGGATGATTTGCTATTGTATATAAATGCTC^^^^^ 

GCGGATCACCTATGGCCAAGGTAGTGGTACAGCAGTTTACAACAACAACATGTA^^^ 

tgtacaacaccgggnatattgccagagttaacctgacc 
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FIGURE 40 

TCTCGCAGATAGTAAATAATCTCGGAAAGGCGAGAAAGAAGCTGTCTCCATCTTGTCTGTAT 

^^^^oSTSF^'^^^^^'^^TG^AG^GGGAGCGTCCTGGGGCTGTGCTCCATGGCGAGCT 
??5I^^5^^TTTGTGTGGAAGTGCCCCGTGTTTGCTATGCCGATGCT^^ 

IS^^^J™£^^^^^^^^°^^^^^<=^TGAATAAGATTCCTGGATTTTGTGAGAATG 
AGAAAGGTGTTGTCCCTTGTAACATTTTGGTTGGCTATAAAGCTGTATATCGTTTGTr 

I^n^oSHT^S^^^^'^^'^G^TTATOTGCTGTCTTTAGTTGCTATCGTCCTGT^ 

TACACTCATCCAGCCAGTTGTTCAGAAAACAAGGCGTTCATCAGTGTciAciTGC^^^ 

CGTTGGTGCTTCTGTAATGTCTATACTGCCAAAAATCCAAGAAT(^cSSg^^^ 

GAACCAGAAACAAATTGCAACCOVAGTCTACTAAGCATAATTGGCTACAATAC^ 

2:^JSS^°^^°^^^<^t^agtccagtggtggcatgctcaag§aa^ 
ISTttttgttgtgtgtattttattccagcatccgtacttcaaacaat^ 
^^^S^^T^taacaagtgatgaatctacattaatagaagatggtggagctagSgtSto^ 

?SJ?S^^^^J^°°°^^°^^^^^^^CGAGCTGTAGATAATGAAAGGGAT^^ 

gttattccttctttcacttcatgcttttcctggcttcactttatatcatSt^ 
aactggtccaggtatgaaccctctcgtgagatgaaaagtcagtggaSot^^ 
aatctcttccagttggattggcatcgtgctgtatgtttggaS??5S^ 

StS^T^?'^^^^^^^°^^^°'^GAGACTTCTAGCATGAAAGTCCCACTTTGATTA^^^ 

IZ^^I^?^^c:agtattcccaacttttgtaaagttgtgtatgtttt'^^ 

gtgcattgatatgtgaagtagaatgaattgcagaggaaagttttatgaatatggtc^ 
tagtaaaagtggccattattgggcttattctctgctctatagttgtgaaatgSgag?^ 
JS^^JJ^^^^^^^'^'^^t^ttatattagaccSSg??^^^ 
JS^tgtatggctgccttttgaaatatttgatgtgttgcctggcaggatactc 

aggttttacccttgatacggaatttacacaggtagggagtgtttagtggac^^^taSt?? 

^ggatggaggtgtcggtactaaattgaataacgagtaaatStct^ 

cctttgccaacaaagtgaactgttttggttgttttaaact^tgaagta^ 

AATGTTTGGAACTCTGAAGGATTTAGACAAGGTTTTcSuiG^^^ 
AAGTGTTTTGAAAGTCACTTTGAAAGTTAGTTTTGGG(^GCA^^^ 

??^SS^S^''^'^^^°^°^°^^t^gtgggtagattacttgagSc^^^ 

tggcacatggtgaacctgttctataaaaataatctggctttgag^tatgcctgtcg^ 

SJS^:^^^^S°^^^^^^^^^^^°ctgagcccagagccaaIggt?^^^ 

cactgcactctagctggcacagagtaagccaaaaaaatatatatatattgaa^ 

caaaattttgacagggaaggaagtaactgcaaaaccactS^^ 

aaatctagtccagttctctcatttaaaaaaatgaagacactgStSa^^^ 

S??JJJ?SI^r''^°^"^^^TTGGCCAATAATAGCATTCTCT^^ 

J5^J™attcaaaatacatgcatattgatttacacctcatactgtgataatt^^ 

ataaactcctgcttatagtatactacacagttcaaaagatgtttaaaatgcttttgtat^^ 
ctgccatgtaattgaaatatatagattattgtaaotttcS^^ 

A^TTTAGTTATTTGTATGTGTCACTAGTGTCTAATGAkGCTS^^^ 

JSJ^^'^^'^^^^^^^^^aatggaatataacaattSt'^ 
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FIGURE 41 

MGSVLGLCSI^WIPCLCGSAPCLLCRCCPSGNNSTWRLIYALFLLVGVCVACVMLIPGME 

EQLNKIPGFCENEKGWPCNILVGYKAVYRLCFGU^FYLLLSLI^IKVKSSSDPRAAVHNG 

FWFFKFAAAIAIIIGAFFIPEGTFTTVWFYVGMAGAFCFILIQLVLLIDFAHSWNESWVEKM 

EEGNSRCWYAALLSATALNYLLSLVAIVLFFVYYTHPASCSENKAFISVNMLLCVGASVMSI 

LPKIQESQPRSGLLQSSVITVYTMYLTWSAMTNEPETNCNPSLLSIIGYNTTSTVPKEGQSV 

QWWHAQGIIGLILFLLCVFYSSIRTSNNSQVNKLTLTSDESTLIEDGGARSDGSLEDGDDVH 

RAVDNERDGVTYSYSFFHFMLFIASLYIMMTLTNWSRYEPSREMKSQWTAVWVKISSSWIGI 
VLYVWTLVAPLVLTNRDFD 
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GCGAGAAAGAAGCTGTCTCCATCTTGTCTGTATCCCGCTGCTTCTTGNGACGTTGTGGAGAT 
GGGGAGCGTCCCTGGGGCTGTGCTCCATGGCGAGCTGGATACCATGTTTGTGTGGAAGTGCC 
CCGTGTTTGCTATGCCGATGCTGTCCTAGTGGAAACAANTCCACTGTAACTAGATTGATCTA 
TGCACTTTTCTTGCTTGTTGGAGTATGTGTAGCTTGTGTAATGTTGATACCAGGAATGGAAG 
AACAACTGAATAAGATTCCTGGATTTTGTGAGAATGAGAAAGGTGTTGTCCCTTGTAACATT 
TTGGTTGGCTATAAAGCTGTATATCGTTTGTGCTTTGGTTTGGCTATGTTCTATCTTCTTCT 
CTCTTTACTAATGATCAAAGTGAAGAGTAGCAGTGATCCTAGAGCTGCAGTGCACAATGGAT 
TTTGGTTCTTTAAATTTGCTGCAGCAATTGCAATTATTATTGGGGC 
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FIGURE 

GTTATTGTGAACTTTGTGGAGATGGQAGGTOmsaSGCTGTGTTCCATGGCGAGCT^ 

CANGTTTGTGTGGAAGTGCCCCGTGTTTGNTATGCCGATGCTGTCCTAGTGGAAACAAN^^ 

ACTGTAATTAGATTGATNTATGCACTTTTNTTGCTTGTTGGAGTANGTGTAGCTTGTGT^^ 

GTTGATACCAGGAATGGAAGAACAACTGAATAAGATTCCTGGATTTTGTGAGAATGAGAA^^ 

GTGTTGTCCCTTGTAACATTTTGGTTGGCTATAAAGCTGTATATOGTTTGTGCTTTG^^ 

GCTANGTTCTATNTTCTTCTCTCTTTACTAATGATCAAAGTGAAGAGTAGCAGTGATCCTAG 

AGCTGCAGTGCACAATGGATTTTGGTTTTTTAAATTTGCTGCAGCAATTGCAATTATTATTG 
GGGC 
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FIGURE 44 

AAGAAGCTGTCTCCATCTTGTCTGTATCCGCTGCTCTTGTGAACGTTNTGGAGATGGGGAGC 

GTCCTTGGGGTTGTGCTCCATGGCGAGCTGGATACCATGTTTGTGTGGAAGTGCCCCGTGTT 

TGCTATGCCGATGCTGTCCTAGTGGAAACAACTCCACTGTAACTAGATTGATCTATGCACTT 

TTCTTGCTTGTTGGAGTATGTGTAGCTTGTGTAATGTTGATACCAGGAATGGAAGAAC^^^ 

GAATAAGATTCCTGGATTTTGTGAGAATGAGAAAGGTGTTGTCCCTTGTAACATTTTGGTTG 

GCTATAAAGCTGTATATCGTTTGTGCTTTGGTTTGGCTATGTTCTATCTTCTTCTCTCTTTA 

CTAATGATCAAAGTGAAGAGTAGCAGTGATCCTAGAGCTGCAGTGCACAATGGATTTTGGTT 
CTTTAAATTTGCTGCAGCAATTGCAATTATTATTGGGGC 
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lATCTATGCACTTTTTCCTTG 
AACAACTGAATA 
TTTTTGGTTGGC 



GCTGTCCTTAGTGGAAACAANTCCAACTTGTAACTTGGATTG 

CTTGITGGAGTATGTGTAGCTTTGTGTAATGTTGTTCCCAGGATTGGANGAA^^^^ 
AGArrcCTGGATTTTTGTGAGAATGAGAAAGGTGTTGTCCCCTTGTAACA' 
TATAAAGCTGTATATCGTTTGTGCTTTGGTTTGGCTATGTTCTATCITCTTCTCT^^ 

aatgatcaaagtgaagagtagcagtgatcctagagctgcagtgcacaatggat™^^^ 

TTAAATITGCTGCAGCAATrGCAArrATTATTGGGGCATrCTTCATTCC^^^^ 
ACAACTGTGTGGTTTTATGTAGGCATGGCAGGTGCCTTTTGTTTCATCCT^^^^ 

GTTGAAAAAATGGAAGAAGGGA 
ACTCGAGATGTTGGTATGCAGCCTTGTTATCAGCTACAGCTCTGAATTATCT^^^^ 

GTTGCTATCGTCCTGTTCTTTGTCTACTACACTCATCCAGCCAGTTGTTCAGAAAAC^^^^ 
GTTCATCAGTGTCAACATGCTCCTCTGCGTTGGTGCTTCTGTAATG 



CTTACTTATTGATTTTGCACATTCATGGAATGAATCGTGG 
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FIGURE 46A 

CTCGGGCGCGCACAGGCAGCTCGGTTTGCCCTGCGATTGAGCTGCGGGTCGCGGCCGGCGCC 

GGCCTCTCCAATGGCAAATGTGTGTGGCTGGAGGCGAGCGCGAGGCTTTCGGCAAAGGCAGT 

CGAGTGTTTGCAGACCGGGGCGAGTCCTGTGAAAGCAGATAAAAGAAAACATTTATTAACGT 

GTCATTACGAGGGGAGCGCCCGGCCGGGGCTGTCGCACTCCCCGCGGAACATTTGGCTCCCT 

CCAGCTCCGAGAGAGGAGAAGAAGAAAGCGGAAAAGAGGCAGATTCACGTCGTTTCCAGCCA 

AGTGGACCTGATCGATGGCCCTCCTGAATTTATCACGATATTTGATTTATTAGCGATGCCCC 

CTGGTTTGTGTGTTACGCACACACACGTGCACACAAGGCTCTGGCTCGCTTCCCTCCCTCGT 

TTCCAGCTCCTGGGCGAATCCCACATCTGTTTCAACTCTCCGCCGAGGGCGAGCAGGAGCGA 

GAGTGTGTCGAATCTGCGAGTGAAGAGGGACGAGGGAAAAGAAACAAAGCCACAGACGCAAC 

TTGAGACTCCCGCATCCCAAAAGAAGCACCAGATCAGCAAAAAAAGAAGATGGGCCCCCCGA 

GCCTCGTGCTGTGCTTGCTGTCCGCAACTGTGTTCTCCCTGCTGGGTGGAAGCTCGGCCTTC 

CTGTCGCACCACCGCCTGAAAGGCAGGTTTCAGAGGGACCGCAGGAACATCCGCCCCAACAT 

CATCCTGGTGCTGACGGACGACCAGGATGTGGAGCTGGGTTCCATGCAGGTGATGAACAAGA 

CCCGGCGCATCATGGAGCAGGGCGGGGCGCACTTCATCAACGCCTTCGTGACCACACCCATG 

TGCTGCCCCTCACGCTCCTCCATCCTCACTGGCAAGTACGTCCACAACCACAACACCTACAC 

CAACAATGAGAACTGCTCCTCGCCCTCCTGGCAGGCACAGCACGAGAGCCGCACCTTTGCCG 

TGTACCTCAATAGCACTGGCTACCGGACAGCTTTCTTCGGGAAGTATCTTAATGAATACAAC 

GGCTCCTACGTGCCACCCGGCTGGAAGGAGTGGGTCGGACTCCTTAAAAACTCCCGCTTTTA 

TAACTACACGCTGTGTCGGAACGGGGTGAAAGAGAAGCACGGCTCCGACTACTCCAAGGATT 

ACCTCACAGACCTCATCACCAATGACAGCGTGAGCTTCTTCCGCACGTCCAAGAAGATGTAC 

CCGCACAGGCCAGTCCTCATGGTCATCAGCCATGCAGCCCCCCACGGCCCTGAGGATTCAGC 

CCCACAATATTCACGCCTCTTCCCAAACGCATCTCAGCACATCACGCCGAGCTACAACTACG 

CGCCCAACCCGGACAAACACTGGATCATGCGCTACACGGGGCCCATGAAGCCCATCCACATG 

GAATTCACCAACATGCTCCAGCGGAAGCGCTTGCAGACCCTCATGTCGGTGGACGACTCCAT 

GGAGACGATTTACAACATGCTGGTTGAGACGGGCGAGCTGGACAACACGTACATCGTATACA 

CCGCCGACCACGGTTACCACATCGGCCAGTTTGGCCTGGTGAAAGGGAAATCCATGCCATAT 

GAGTTTGACATCAGGGTCCCGTTCTACGTGAGGGGCCCCAACGTGGAAGCCGGCTGTCTGAA 

TCCCCACATCGTCCTCAACATTGACCTGGCCCCCACCATCCTGGACATTGCAGGCCTGGACA 

TACCTGCGGATATGGACGGGAAATCCATCCTCAAGCTGCTGGACACGGAGCGGCCGGTGAAT 

CGGTTTCACTTGAAAAAGAAGATGAGGGTCTGGCGGGACTCCTTCTTGGTGGAGAGAGGCAA 

GCTGCTACACAAGAGAGACAATGACAAGGTGGACGCCCAGGAGGAGAACTTTCTGCCCAAGT 

ACCAGCGTGTGAAGGACCTGTGTCAGCGTGCTGAGTACCAGACGGCGTGTGAGCAGCTGGGA 

CAGAAGTGGCAGTGTGTGGAGGACGCCACGGGGAAGCTGAAGCTGCATAAGTGCAAGGGCCC 

CATGCGGCTGGGCGGCAGCAGAGCCCTCTCCAACCTCGTGCCCAAGTACTACGGGCAGGGCA 

GCGAGGCCTGCACCTGTGACAGCGGGGACTACAAGCTCAGCCTGGCCGGACGCCGGAAAAAA 

CTCTTCAAGAAGAAGTACAAGGCCAGCTATGTCCGCAGTCGCTCCATCCGCTCAGTGGCCAT 

CGAGGTGGACGGCAGGGTGTACCACGTAGGCCTGGGTGATGCCGCCCAGCCCCGAAACCTCA 

CCAAGCGGCACTGGCCAGGGGCCCCTGAGGACCAAGATGACAAGGATGGTGGGGACTTCAGT 

GGCACTGGAGGCCTTCCCGACTACTCAGCCGCCAACCCCATTAAAGTGACACATCGGTGCTA 

CATCCTAGAGAACGACACAGTCCAGTGTGACCTGGACCTGTACAAGTCCCTGCAGGCCTGGA 

AAGACCACAAGCTGCACATCGACCACGAGATTGAAACCCTGCAGAACAAAATTAAGAACCTG 

AGGGAAGTCCGAGGTCACCTGAAGAAAAAGCGGCCAGAAGAATGTGACTGTCACAAAATCAG 

CTACCACACCCAGCACAAAGGCCGCCTCAAGCACAGAGGCTCCAGTCTGCATCCTTTCAGGA 

AGGGCCTGCAAGAGAAGGACAAGGTGTGGCTGTTGCGGGAGCAGAAGCGCAAGAAGAAACTC 

CGCAAGCTGCTCAAGCGCCTGCAGAACAACGACACGTGCAGCATGCCAGGCCTCACGTGCTT 

CACCCACGACAACCAGCACTGGCAGACGGCGCCTTTCTGGACACTGGGGCCTTTCTGTGCCT 

GCACCAGCGCCAACAATAACACGTACTGGTGCATGAGGACCATCAATGAGACTCACAATTTC 
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FIGURE 4^R 

gctgatgaatgcagtgaacacactggacagggatgtcctcaaccagcSSSta^^^^ 

TGGAGCTGAGGAGCTGCAAGGGTTACAAGCAGTGTAACCCCCGGAC?^G^ 
GATGGAGGAAGCTATGAGCAATACAGGCAGTTTCAGCGTCGAA^G^Sc^^^ 

accttcttccaaatcactgggacaactgtgggaaggctgggaaggttaaS^ 

rS^SS!5^????^3^^°^°°^TCACCTGACTGCACAGGC^^ 



GCCCC 



^TGACAGATTCTGGAGGATAACCAGCAGGAGCAGAGATAACTTCAGGAAGTCCATm^ 
GCCCCTGCTTTTGCTTTGGATT^^^^ 



GATTTCCTTGGAAATTTCTCCCAAGGGCGAAAGTCATTGGAATTTTTAAATCATAGGGGAAA 

agcagtcctgttctaaatcctcttattcttttggtttgtcacSaga^I^a^^ 
g™ggcaacgtggagaggctgaaaacagtgcIgagS^^ 

t^5?f°''°^'^^^^'^™^^^A°^CTATAAACCCTGGTrGCCTCT^ 
AAATCCCAATTTTCAGGAGTGGTGGTGTCAATAAACGCTCTGTGGCCAGTGTAAAAGAAAAA 
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FIGURE 47 

MGPPSLVLCLLSATVFSLLGGSSAFLSHHRLKGRFQRDRRNIRPNIILVLTDDQDVELGSMQ 

VMNKTRRIMEQGGAHFINAFVTTPMCCPSRSSILTGKYVHNHNTYTNNENCSSPSWQAQHES 

RTFAVYLNSTGYRTAFFGKYLNEYNGSYVPPGWKEWVGLLKNSRFYNYTLCRNGVKEKHGSD 

YSKDYLTDLITODSVSFFRTSKKMYPHRPVLMVISHAAPHGPEDSAPQYSRLFPNASQHITP 

SYNYAPNPDKHWIMRYTGPMKPIHMEFTNMI^RKRLQTIJ1SVDDSMETIYimLVETGEIJ)NT 

YIVYTADHGYHIGQFGLVKGKSMPYEFDIRVPFYVRGPNVEAGCLNPHIVLNIDLAPTILDI 

AGU)IPADMDGKSILKLLDTERPVNRFHLKKKMRVWRDSFLVERGKIiHKRDNDKVD^^ 

FLPKYQRVKDLCQRAEYQTACEQLGQKWQCVEDATGKLKLHKCKGPMRLGGSRALSNLVPKY 

YGQGSEACTCDSGDYKLSLAGRRKKLFKKKYKASYVRSRSIRSVAIEVDGRVYHVGLGDAAQ 

PRNLTKRHWPGAPEDQDDKDGGDFSGTGGLPDYSAANPIKVTHRCYILENDTVQCDLDLYKS 

LQAWKDHKLHIDHEIETLQNKIKNLREVRGHLKKKRPEECDCHKISYHTQHKGRLKHRGSSL 

HPFRKGLQEKDKVWLLREQKRKKKLRKLLKRLQNNDTCSMPGLTCFTHDNQHWQTAPFWTLG 

PFCACTSANNNTYWCMRTINETHNFLFCEFATGFLEYFDLNTDPYQLMNAVNTLDRDVLNQL 

HVQLMELRSCKGYKQCNPRTRNMDLDGGSYEQYRQFQRRKWPEMKRPSSKSLGQLWEGWEG 
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FIGURE 4« 

AACAAAGTTCAGTGACTGAGAGGGCTGAGCGGAGGCTGCTGAAGGGGAGAAAGGAGTGAGGA 
GCTGCTGGGCAGAGAGGGACTGTCCGGCTCCCAGAT6CTGGGCCTCCTGGGGAGCACAGCCC 
TCGTGGGATGGATCACAGGTGCTGCTGTGGCGGTCCTGCTGCTGCTGCTGCTGCTGGCCACC 
TGCCTTTTCCACGGACGGCAGGACTGTGACGTGGAGAGGAACCGTACAGCTGCAGGGGGAAA 
CCGAGTCCGCCGGGCCCAGCCTTGGCCCTTCCGGCGGCGGGGCCACCTGGGAATCTTTCACC 
ATCACCGTCATCCTGGCCACGTATCTCATGTGCCGAATGTGGGCCTCCACCACCACCACCAC 
CCCCGCCACACCCCTCACCACCTCCACCACCACCACCACCCCCACCGCCACCATCCCC^^ 

CGCTCGCTGAGGCTGCTGTCGCCGGTGCCTGTGGACAGCAGCTGCCCCTGCCCTCCCATCTG 

TTCCCAGGACAAGTGGACCCCATGTTTCCATGTGGAAGGATGCATCTCTGGGGTGAACGAGG 

GGAACAATAGACTGGGGCTTGCTCCAGCTGCATTTGCATGGCATGCCCCAGTGTACTATGGC 

AGCAGAGAATGGAGGAACACTGGGTCTGCAGTGCTGAAGGGTTTGGGGAGTGGAGAGCAAGG 

GTGCTCTTTCGGGGCTGGACAGCCCGTCTTGTGACAGTGACTCCCAGTGAGCCCCAGAAATG 

ACAAGCGTGTCTTGGCAGAGCCAGCACACAAGTGGATGTGAAGTGCCCGTCTTGACCTCCTC 

ATCAGGCTGCTGCAGGCCTCTGGCGGGCAGGGCACTGGGAGAGGCCCTGAGAATGTCCTTTT 

GGTTTGGAGAAGGCAGTGTGAGGCTGCACAGTCAATTCATCGGTGCCTTAGTCCAAGAAAAT 
AAAAACCACTAAGAAGCTTTAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 49 



MLGLLGSTALVGWITGAAVAVLLLLLLLATCLFHGRQDCDVERNRTAAGGNRVRRAQPWPFR 
RRGHLGI FHHHRHPGHVSHVPNVGLHHHHHPRHTPHHLHHHHHPHRHHPRHAR 
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FIGURE j>n 

GGCGGCTGCTGAGCTGCCTTGAGGTGCAGTGTTGGGGATCCAGAGCC^TCGGACCTGCTA 

CTACTGGGCCTGATTGGGGGCCTGACTCTCTTACTGCTGCTGACGCTGCTGGCCTTTGCCGG 

GTACTCAGGGCTACTGGCTGGGGTGGAAGTGAGTGCTGGGTCACCCCCCATCCGCAACGTCA 

CTGTGGCCTACAAGTTCCACATGGGGCTCTATGGTGAGACTGGGCGGCTTTTCACTGAGAGC 

TGCAGCATCTCTCCCAAGCTCCGCTCCATCGCTGTCTACTATGACAACCCCCACATO^^^^ 

CCCTGATAAGTGCCGATGTGCCGTGGGCAGCATCCTGAGTGAAGGTGAGGAATCGCCCTCCC 

CTGAGCTCATCGACCTCTACCAGAAATTTGGCTTCAAGGTGTTCTCCTTCCCGGCACCC^^ 

CATGTGGTGACAGCCACCTTCCCCTACACCACCATTCTGTCCATCTGGCTGGCTACCC^^^^ 

TGTCCATCCTGCCTTGGACACCTACATCAAGGAGCGGAAGCTGTGTGCCTATCCTCGGCTGG 

AGATCTACCAGGAAGACCAGATCCATTTCATGTGCCCACTGGCACGGCAGGGAGACTTCTAT 

GTGCCTGAGATGAAGGAGACAGAGTGGAAATGGCGGGGGCTTGTGGAGGCCATTGACACCCA 

GGTGGATGGCACAGGAGCTGACACAATGAGTGACACGAGTTCTGTAAGCTTGGAAGTGAGCC 

CTGGCAGCCGGGAGACTTCAGCTGCCACACTGTCACCTGGGGCGAGCAGCCGTGGCTGGGAT 

GACGGTGACACCCGCAGCGAGCACAGCTACAGCGAGTCAGGTGCCAGCGGCTCCTCTTTTGA 

GGAGCTGGACTTGGAGGGCGAGGGGCCCTTAGGGGAGTCACGGCTGGACCCTGGGACTGAGC 

CCCTGGGGACTACCAAGTGGCTCTGGGAGCCCACTGCCCCTGAGAAGGGCAAGGAGTAACCC 

ATGGCCTGCACCCTCCTGCAGTGCAGTTGCTGAGGAACTGAGCAGACTCTCCAGCAGACTCT 

CCAGCCCTCTTCCTCCTTCCTCTGGGGGAGGAGGGGTTCCTGAGGGACCTGACTTCCCCTGC 

TCCAGGCCTCTTGCTAAGCCTTCTCCTCACTGCCCTTTAGGCTCCCAGGGCCAGAGGAGCCA 

GGGACTATTTTCTGCACCAGCCCCCAGGGCTGCCGCCCCTGTTGTGTCTTTTTTTCAGACTC 

ACAGTGGAGCTTCCAGGACCCAGAATAAAGCCAATGATTTACTTGTTTCACCTGGAAAAAAA 
AAAA/^AAAAA 
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FIGURE SI 

MSDLLLLGLIGGLTLLLLLTLLAFAGYSGLLAGVEVSAGSPPIRNVTVAYKFHMGLYGETGR 
LFTESCSISPKLRSIAVYYDNPHMVPPDKCRCAVGSILSEGEESPSPELIDLYQKFGFKVFS 
FPAPSHWTATFPYTTILSIWLATRRVHPALDTYIKERKLCAYPRLEIYQEDQIHFMCPLAR 
QGDFYVPEMKETEWKWRGLVEAIDTQVDGTGADTMSDTSSVSLEVSPGSRETSAATLSPGAS 
SRGWDDGDTRSEHSYSESGASGSSFEEU)I£GEGPI/3ESRIJ3PGTEPLGTTKWLWEPTAPEKGKE 
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FIGURE ^2 
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CCGCGGGAACGCTGTCCTGGCTGCCGCCACCCGAACAGCCTGTCCTGGTGCCCCGGCTCCCT 

GCCCCGCGCCCAGTC&TGACCCTGCGCCCCTCACTCCTCCCGCTCCATCTGCTGCTGCTGCT 

GCTGCTCAGTGCGGCGGTGTGCCGGGCTGAGGCTGGGCTCC3AAACCGAAAGTCCCGTCCGGA 

CCCTCCAAGTGGAGACCCTGGTGGAGCCCCCAGAACCATGTGCCGAGCCCGCTGCTTTTGGA 

GACACGCTTCACATACACTACACGGGAAGCTTGGTAGATGGACGTATTATTGACACCTCCCT 

GACCAGAGACCCTCTGGTTATAGAACTTGGCCAAAAGCAGGTGATTCCAGGTCTGGAGCAGA 

GTCTTCTCGACATGTGTGTGGGAGAGAAGCGAAGGGCAATCATTCCTTCTCACTTGGCCTAT 

GGAAAACGGGGATTTCCACCATCTGTCCCAGCGGATGCAGTGGTGCAGTATGACGTGGAGCT 

GATTGCACTAATCCGAGCCAACTACTGGCTAAAGCTGGTGAAGGGCATTTTGCCTCTGGTAG 

GGATGGCCATGGTGCCAGCCCTCCTGGGCCTCATTGGGTATCACCTATACAGAAAGGCCAAT 

AGACCCAAAGTCTCCAAAAAGAAGCTCAAGGAAGAGAAACGAAACAAGAGCAAAAAGAAAIA 

ATAAATAATAAATTTTAAAAAACTTAAAAAAAAAAAAAAAAAA 
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FIGURE ^1 

MTLRPSLLPLHLLLLLLLSAAVCRAEAGLETESPVRTLQVETLVEPPEPCAEPAAFGDTLHI 

HYTGSLVDGRIIDTSLTRDPLVIELGQKQVIPGLEQSLLDMCVGEKRRAIIPSHLAYGKRGF 

PPSVPADAWQYDVELIALIRANYWLKLVKGILPLVGMAMVPALLGLIGYHLYRKANRPKVS 
KKKLKEEKRNKSKKK 
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CCCGGGAACGTGTTCCTGGCTGCCGCACCCGAACAGCCTGTCCTGGTGCCCCGGCTCCCTGC 

CCCGCGCCa^GTCATGACCCTGCGCCCCT(mCTCCTCCCGCTCCATCTGCTGCTGCTGCTC^ 

TGCTCAGTGCGGCGGTGTGCCGGGCTGAGGCTGGGCTCGAAACCGAAAGTCCCGTCCGGACC 

CTCCAAGTGGAGACCCTGGTGGAGCCCCCAGAACCT^TGTGCCGAGCCCGCTGCTTTTGGAGA 

CACGCTTCACATACACTACACGGGAAGCTTGGTAGATGGACGTATTATTGACACCTCCCTGA 

CCAGAGACCCTCTGGTTATAGAACTTGGCCAAAAGCAGGTGATTCCAGGTCTGGAGCAGAGT 

CTTCTCGACATGTGTGTGGGAGAGAAGCGAAGGGCAATCATTCCTTCTCACTTGGCCTATGG 

AAAACGGGGATTTCCACCATCTGTCCCAGCGGATGCAGTGGTGCAGTATGACGTGGAGCTGA 

TTGCACTAATCCGAGCCAACTACTGGCTAAAGCTGGTGAAGGGCATTTTGCCTCTGGTAGGG 

ATGGCCATGGTGCCACCCTCCTGGGCCTCATTGGGTATCACCTATACAGAAAGGCCAATAGA 

CCCAAAGTCTCCAAAAAGAAGCTCAAGGAAGAGAAACGAAACAAGAGCAAAAAGAAATAATA 
AATAATAAATTTTAAAAAACTTA 
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CCGAAAGTCCCGTCCGGACCCrCCAAGTGGAGACCCTGGTGGAGCCCCCAGAACC^^^ 

GAGCCCGCTGCTTTTGGAGACACGCTTCACATACACTACACGGGAAGCTTGGTAGATC^^^ 

TATTATTGACACCTCCCTGACCAGAGACCCTCTGGTTATAGAACTTGGCCAAAAGCAG^^^^ 

TTCCAGGTCTGGAGCAGAGTCTTCTCGACATCTGTGTGGGAGAGAAGCGAAGGGCAAT^^^ 

CCTTCTCACTTGGCCTATGGAAAACGGGGATTTCCACCATCTGTCCCAGCGGATGCAGTGGT 

GCAGTATGACGTGGAGCTGATTGCACTAATCCGAGCCAACTACTGGCTAAAGCTGGTGAAGG 

GCATTTTGCCTCTGGTAGGGATGGCCATGGTGCCAGCCCTCCTGGGCCTCATTGGGTATCAC 

CTATACAGAAAGGCCAATAGACCCAAAGTCTCCAAAAAGAAGCTCAAGGAAGAGAAACGAAA 
CAAGAGCAAAAAGAAATAATAAATAATAAATTTTAAAAAACTTAAAA 
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FIGURE Sli 

CTGCTGCATCCGGGTGTCTGGAGGCTGTGGCCGTTTTGTTTTCTTGGCTAAAATCGGG^^^ 

TGAGGCGGGCCGGCGCGGCGCGACACCX^CTCCGGAACCACTGCACGACGGGGCTGGACTG 

ACCTGAAAAAAASGTCTGGATTTCTAGAGGGCTTGAGATGCTCAGAATGCATTGACTGGGGG 

GAAAAGCGCAATACTATTGCTTCCATTGCTGCTGGTGTACTATTTTTTACAGGCTGGTGGAT 

TATCATAGATGCAGCTGTTATTTATCCCACCATGAAAGATTTCAACCACTCATACCATGCCT 

GTGGTGTTATAGCAACCATAGCCTTCCTAATGATTAATGCAGTATCGAATGGACAAGTCCGA 

GGTGATAGTTACAGTGAAGGTTGTCTGGGTCAAACAGGTGCTCGCATTTGGCTTTTCGTTGG 

TTTCATGTTGGCCTTTGGATCTCTGATTGCATCTATGTGGATTCTTTTTGGAGGTTATGTTG 

CTAAAGAAAAAGACATAGTATACCCTGGAATTGCTGTATTTTT.CCAGAATGCCTTCATCTTT 

TTTGGAGGGCTGGTTTTTAAGTTTGGCCGCACTGAAGACTTATGGCAGTGAACACATCTGAT 

TTCCCACAGCACAACAGCCCTGCATGGGTTTGTTTGTTTTTTTACTGCTCACTCCCAACCTT 

TTGTAATGCCATTTTCTAAACTTATTTCTGAGTGTAGTCTCAGCTTAAAGTTGTGTAATACT 

AAAATCACGAGAACACCTAAACAACAACCAAAAATCTATTGTGGTATGCACTTGATTAACTT 

ATAAAATGTTAGAGGAAACTTTCACATGAATAATTTTTGTCAAATTTTATCATGGTATAATT 

TGTAAAAATAAAAAGAAATTACAAAAGAAATTATGGATTTGTCAATGTAAGTATTTGTCATA 

TCTGAGGTCCAAAACCACAATGAAAGTGCTCTGAAGATTTAATGTGTTTATTCAAATGTGGT 

CTCTTCTGTGTCAAATGTTAAATGAAATATAAACATTTTTTAGTTTTTAAAATATTCCGTGG 

TCAAAATTCTTCCTCACTATAATTGGTATTTACTTTTACCAAAAATTCTGTGAACATGTAAT 

GTAACTGGCTTTTGAGGGTCTCCCAAGGGGTGAGTGGACGTGTTGGAAGAGAGAAGCACCAT 

GGTCCAGCCACCAGGCTCCCTGTGTCCCTTCCATGGGAAGGTCTTCCGCTGTGCCTCTCATT 

CCAAGGGCAGGAAGATGTGACTCAGCCATGACACGTGGTTCTGGTGGGATGCACAGTCACTC 
CACATCCACCACTG 
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FIGURE j>7 

MSGFLEGLRCSECIDWGEKRNTIASIAAGVLFFTGWWI I IDAAVIYPTMKDFNHSYHACGVI 

ATIAFLMINAVSNGQVRGDSYSEGCLGQTGARIWLFVGFMLAFGSLIASMWILFGGYVAKEK 
DIVYPGIAVFFQNAFIFFGGLVFKFGRTEDLWQ 
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FIGURE 

TTCTT«;CTi^TCGGGGGAGTGAGGC<KK3CCGGCGCGGCGCGACACCGGGCTCC^^ 

ACTGCACGACGGGGCTGGACTGACCTGAAAAAAATGTCTGGATTTCTAGAGGGCTTGAGATG 

CTCAGAATGCATTGACTGGGGGGAAAAGCGCAATACTATTGCTTCCATTGCTGCTGGTGTAC 

TATTTTTTACAGGCTGGTGGATTATCATAGATGCAGCTGTTATTTATCCCACCATGAAAGAT 

TTCAACCACTCATACCATGCCTGTGGTGTTATAGCAACCATAGCCTTCCTAATGATTAATGC 

AGTATCGAATGGACAAGTCCGAGGTGATAGTTACAGTGAAGGTTGTCTGGGTCAAACAGGTG 

CTCGCATTTGGCTTTTCGTTGGTTTCATGTTGGCCTTTGGATCTCTGATTGCATCTATGTGG 

ATTCTTTTTGGAGGTTATGTTGCTAAAGAAAAAGACATAGTATACCCTGGAATTGCTGTATT 
TTTCCAGAATGCCTTCATCTTTTTTGGAGGGCTGGTTTTTAAGTTTGGC 
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TGGACGGACCTGAAAAAAATGTTTGGATTTNTAGAGGGNTTGAGATGTTCAGAATGCATGAC 
TGGGGGAAAAGCGCAAATACTATTGCTTCCATTGCTGCTGGTGTANTATTTTTTACAGGCTG 
GTGGATTATCATAGATGCAGNTGTTATTTATCCCACCATGAAAGATTTCAACCANTCATACC 
ATGCCTGTGGTGTTATAGCAACCATAGCCTTCNTAATGATTAATGCAGTATCGAATGGACAA 
GTCCGAGGTGATAGTTACAGTGAAGGTTGTTTGGGTCAAACAGGTGCTCGCATTTGGCTTTT 
CGTTGGTTTCATGTTGGCCTTTGGATCTCTGATTGCATCTATGTGGATTCTTTTTGGAGGTT 
ATGTTGCTAAAGAAAAAGACATAGTATACCCTGGAATTGNTGTATTTTTCCAGAATGCCTTC 
ATCTTTTTTGGAGGGCTGGTTTTTAAGTTTGGCCGCACTGAAGANTTATGGCAGTG 
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GGACACCGGGTTCCGGACCAATGCANGAOSGGGTCSGANTGACCTGAAAAAAATGTTTGGATT 

TTTAGAGGGCTTGAGATGNTCAGAATGCATTGACTGGGGGAAAAGCGCAATANTATTGCTTT 

CCATTGCTGCTGGTGTACTATTTTTTACAGGGTGGTGGATTATCATAGATGCAGCTGTTATT 

TATCCCACCATGAAAGATTTNAACCACTCATACCATGCCTGTGGTGTTATAGCAACCATAGC 

CTTCCTAATGATTAATGCAGTATCGAATGGACAAGTCCGAGGTGATAGTTACAGTGAAGGTT 

GTTTGGGTCAAACAGGTGNTCGCATTTGGCTTTTCGTTGGTTTCATGTTGGCCTTTGGATTT 

CTGATTGNATTCTATGCGGATTCTTCTTGGAGGTTATGTTGCTAAAGAAAAAGACATAGTAT 
ACCCTGGAATTNCTNTATTTTTCCAGAATGCC 
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FIGURE 61 

TAGAGGGCrTGAGATGCTCAGAATGCATTGACTGGG«3GAAAAGCGCAATANTATT^^^^ 

ATTGNTGNTGGTGTANTATTrrTTTACAGGCTGGTGGATTATNATAGATGCAGCTGTTATTT 
ATCCCACCATGAAAGATTT^AACCANTCATACCATGCCTGTGGTGTTATAG 

TTCCTAATGAITAATGCAGTA^GAATGGACAAGTCCGAGGTGATAGTTACAGTGAAGG^^ 
TTTGGGTCAAACAGGTGNTNGCATTTGGCTTTTNGTTGGTrrCATGTTGGCC™^^ 

TGATTGCATTTATGTGGATTNTrrrrGGAGGTTATGTTGCTAAAGNAAAAGACATAGTATAC 
CCTGT 
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FIGIJRK ^2 

GGGAGGCTGTGNCCGTT1TGTTTTNTTGGCTA7^TCG«5CK5AGTGAGGCGGCCCGGC^^^ 

CGNGACACCGGGTTCCGGGAACCATTGCACGACGGGGTGGACTGACCTGAAAAAAATGTTTG 

GATTTNTAGAGGGCTTGAGATGCTCAGAATGCATTGACTGGGGGGAAAAGCGCAATACTATT 

GCTTCCATTGCTGCTGGTGTACTATTTTTTACAGGCTGGTGGATTATCATAGATGCAGCT^ 

TATTTATCCCACCATGAAAGATTTCAACCACTCATACCATGCCTGTGGTGTTATAGCAACCA 

TAGCCTTCCTAATGATTAATGCAGTATCGAATGGACAAGTCCGAGGTGATAGTTACAGTGAA 

GGTTGTCTGGGTCAAACAGGTGCTCGCATTTGGCTTTTCGTTGGTTTCATGTTGGCCTTTGG 

ATNTCTGATTGCATCTATGTGGATTCTTTTTGGAGGTTATGTTGCTAAAGAAAAAGACATAG 

TATACCCTGGAATTGCTGTATTTTTCCAGAATGCCTTCATNTTTTTTGGAGGGCTC 
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CGACGCCGGCGTGATGTGGCTTCCGCTGGTGCTGCTCCTGGCTGTGCTGCTGCTGGCCGTCC 

TCTGCAAAGTTTACTTGGGACTATTCTCTGGCAGCTCCCCGAATCCTTTCTCCGAAGATGTC 

AAACGGCCCCCAGCGCCCCTGGTAACTGACAAGGAGGCCAGGAAGAAGGTTCTCAAACAAGC 

TTTTTCAGCCAACCAAGTGCCGGAGAAGCTGGATGTGGTGGTAATTGGCAGTGGCTTTGGGG 

GCCTGGCTGCAGCTGCAATTCTAGCTAAAGCTGGCAAGCGAGTCCTGGTGCTGGAACAACAT 

ACCAAGGCAGGGGGCTGCTGTCATACCTTTGGAAAGAATGGCCTTGAATTTGACACAGGAAT 

CCATTACATTGGGCGTATGGAAGAGGGCAGCATTGGCCGTTTTATCTTGGACCAGATCACTG 

AAGGGCAGCTGGACTGGGCTCCCCTGTCCTCTCCTTTTGACATCATGGTACTGGAAGGGCCC 

AATGGCCGAAAGGAGTACCCCATGTACAGTGGAGAGAAAGCCTACATTCAGGGCCTCAAGGA 

GAAGTTTCCACAGGAGGAAGCTATCATTGACAAGTATATAAAGCTGGTTAAGGTGGTATCCA 

GTGGAGCCCCTCATGCCATCCTGTTGAAATTCCTCCCATTGCCCGTGGTTCAGCTCCTCGAC 

AGGTGTGGGCTGCTGACTCGTTTCTCTCCATTCCTTCAAGCATCCACCCAGAGCCTGGCTGA 

GGTCCTGCAGCAGCTGGGGGCCTCCTCTGAGCTCCAGGCAGTACTCAGCTACATCTTCCCCA 

CTTACGGTGTCACCCCCAACCACAGTGCCTTTTCCATGCACGCCCTGCTGGTCAACCACTAC 

ATGAAAGGAGGCTTTTATCCCCGAGGGGGTTCCAGTGAAATTGCCTTCCACACCATCCCTGT 

GATTCAGCGGGCTGGGGGCGCTGTCCTCACAAAGGCCACTGTGCAGAGTGTGTTGCTGGACT 

CAGCTGGGAAAGCCTGTGGTGTCAGTGTGAAGAAGGGGCATGAGCTGGTGAACATCTATTGC 

CCCATCGTGGTCTCCAACGCAGGACTGTTCAACACCTATGAACACCTACTGCCGGGGAACGC 

CCGCTGCCTGCCAGGTGTGAAGCAGCAACTGGGGACGGTGCGGCCCGGCTTAGGCATGACCT 

CTGTTTTCATCTGCCTGCGAGGCACCAAGGAAGACCTGCATCTGCCGTCCACCAACTACTAT 

GTTTACTATGACACGGACATGGACCAGGCGATGGAGCGCTACGTCTCCATGCCCAGGGAAGA 

GGCTGCGGAACACATCCCTCTTCTCTTCTTCGCTTTCCCATCAGCCAAAGATCCGACCTGGG 

AGGACCGATTCCCAGGCCGGTCCACCATGATCATGCTCATACCCACTGCCTACGAGTGGTTT 

GAGGAGTGGCAGGCGGAGCTGAAGGGAAAGCGGGGCAGTGACTATGAGACCTTCAAAAACTC 

CTTTGTGGAAGCCTCTATGTCAGTGGTCCTGAAACTGTTCCCACAGCTGGAGGGGAAGGTGG 

AGAGTGTGACTGCAGGATCCCCACTCACCAACCAGTTCTATCTGGCTGCTCCCCGAGGTGCC 

TGCTACGGGGCTGACCATGACCTGGGCCGCCTGCACCCTTGTGTGATGGCCTCCTTGAGGGC 

CCAGAGCCCCATCCCCAACCTCTATCTGACAGGCCAGGATATCTTCACCTGTGGACTGGTCG 

GGGCCCTGCAAGGTGCCC,TGCTGTGCAGCAGCGCCATCCTGAAGCGGAACTTGTACTCAGAC 

CTTAAGAATCTTGATTCTAGGATCCGGGCACAGAAGAAAAAGAATTAGTTCCATCAGGGAGG 

AGTCAGAGGAATTTGCCCAATGGCTGGGGCATCTCCCTTGACTTACCCATAATGTCTTTCTG 

CATTAGTTCCTTGCACGTATAAAGCACTCTAATTTGGTTCTGATGCCTGAAGAGAGGCCTAG 

TTTAAATCACAATTCCGAATCTGGGGCAATGGAATCACTGCTTCCAGCTGGGGCAGGTGAGA 

TCTTTACGCCTTTTATAACATGCCATCCCTACTAATAGGATATTGACTTGGATAGCTTGATG 

TCTCATGACGAGCGGCGCTCTGCATCCCTCACCCATGCCTCCTAACTCAGTGATCAAAGCGA 

ATATTCCATCTGTGGATAGAACCCCTGGCAGTGTTGTCAGCTCAACCTGGTGGGTTCAGTTC 

TGTCCTGAGGCTTCTGCTCTCATTCATTTAGTGCTACGCTGCACAGTTCTACACTGTCAAGG 

GAAAAGGGAGACTAATGAGGCTTAACTCAAAACCTGGGCGTGGTTTTGGTTGCCATTCCATA 

GGTTTGGAGAGCTCTAGATCTCTTTTGTGCTGGGTTCAGTGGCTCTTCAGGGGACAGGAAAT 

GCCTGTGTCTGGCCAGTGTGGTTCTGGAGCTTTGGGGTAACAGCAGGATCCATCAGTTAGTA 

GGGTGCATGTCAGATGATCATATCCAATTCATATGGAAGTCCCGGGTCTGTCTTCCTTATCA 

TCGGGGTGGCAGCTGGTTCTCAATGTGCCAGCAGGGACTCAGTACCTGAGCCTCAATCAAGC 

CTTATCCACCAAATACACAGGGAAGGGTGATGCAGGGAAGGGTGACATCAGGAGTCAGGGCA 

TGGACTGGTAAGATGAATACTTTGCTGGGCTGAAGCAGGCTGCAGGGCATTCCAGCCAAGGG 

CACAGCAGGGGACAGTGCAGGGAGGTGTGGGGTAAGGGAGGGAAGTCACATCAGAAAAGGGA 

AAGCCACGGAATGTGTGTGAAGCCCAGAAATGGCATTTGCAGTTAATTAGCACATGTGAGGG 

TTAGACAGGTAGGTGAATGCAAGCTCAAGGTTTGGAAAAATGACTTTTCAGTTATGTCTTTG 

GTATCAGACATACGAAAGGTCTCTTTGTAGTTCGTGTTAATGTAACATTAATAAATTTATTG 
ATTCCATTGCTTTAAAAAAAAAAAAAAA 
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MWLPLVLLLAVLLLAVLCKVYLGLFSGSSPNPFSEDVKRPPAPLVTDKEARKKVLKQAFSAN 

QVPEKLDVWIGSGFGGLAAAAILAKAGKRVLVLEQHTKAGGCCHTFGKNGLEFDTGIHYIG 

RMEEGSIGRFILDQITEGQLDWAPLSSPFDIMVLEGPNGRKEYPMYSGEKAYIQGLKEKFPQ 

EEAIIDKYIKLVKWSSGAPHAILLKFLPLPWQLLDRCGLLTRFSPFLQASTQSLAEVLQQ 

LGASSELQAVLSYI FPTYGVTPNHSAFSMHALLVNHYMKGGFYPRGGSSEIAFHTI PVIQRA 

GGAVLTKATVQSVLLDSAGKACGVSVKKGHELVNIYCPIWSNAGLFNTYEHLLPGNARCLP 

GVKQQLGTVRPGLGMTSVFICLRGTKEDLHLPSTNYYVYYDTDMDQAMERYVSMPREEAAEH 

I PLLFFAFPSAKDPTWEDRFPGRSTMIMLI PTAYEWFEEWQAELKGKRGSDYETFKNSFVEA 

SMSWLKLFPQLEGKVESVTAGSPLTNQFYLAAPRGACYGADHDLGRLHPCVMASLRAQSPI 

PNLYLTGQDIFTCGLVGALQGALLCSSAILKRNLYSDLKNLDSRIRAQKKKN 
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FIGURE 6.^ 

GCAGCGGCGAGGCGGCGGTGGTGGCTGAGTCCGTGGTGGCAGAGGCGAAGGCGACAGCTCTA 

GGGGTTGGCACCGGCCCCGAGAGGAGGATCCGGGTCCGGATAGGGCTGACGCTGCTGCTGTG 

TGCGGTGCTGCTGAGCTTGGCCTCGGCGTCCTCGGATGAAGAAGGCAGCCAGGATGAATCCT 

TAGATTCCAAGACTACTTTGACATCAGATGAGTCAGTAAAGGACCATACTACTGCAGGCAGA 

GTAGTTGCTGGTCAAATATTTCTTGATTCAGAAGAATCTGAATTAGAATCCTCTATTCAAGA* 

AGAGGAAGACAGCCTCAAGAGCCAAGAGGGGGAAAGTGTCACAGAAGATATCAGCTTTCTAG 

AGTCTCCAAATCCAGAAAACAAGGACTATGAAGAGCCAAAGAAAGTACGGAAACCAGCTTTG 

ACCGCCATTGAAGGCACAGCACATGGGGAGCCCTGCCACTTCCCTTTTCTTTTCCTAGATAA 

GGAGTATGATGAATGTACATCAGATGGGAGGGAAGATGGCAGACTGTGGTGTGCTACAACCT 

ATGACTACAAAGCAGATGAAAAGTGGGGCTTTTGTGAAACTGAAGAAGAGGCTGCTAAGAGA 

CGGCAGATGCAGGAAGCAGAAATGATGTATCAAACTGGAATGAAAATCCTTAATGGAAGCAA 

TAAGAAAAGCCAAAAAAGAGAAGCATATCGGTATCTCCAAAAGGCAGCAAGCATGAACCATA 

CCAAAGCCCTGGAGAGAGTGTCATATGCTCTTTTATTTGGTGATTACTTGCCACAGAATATC 

CAGGCAGCGAGAGAGATGTTTGAGAAGCTGACTGAGGAAGGCTCTCCCAAGGGACAGACTGC 

TCTTGGCTTTCTGTATGCCTCTGGACTTGGTGTTAATTCAAGTCAGGCAAAGGCTCTTGTAT 

ATTATACATTTGGAGCTCTTGGGGGCAATCTAATAGCCCACATGGTTTTGGTAAGTAGACTT 

lAGTGGAAGGCTAATAATATTAACATCAGAAGAATTTGTGGTTTATAGCGGCCACAACTTTT 

TCAGCTTTCATGATCCAGATTTGCTTGTATTAAGACCAAATATTCAGTTGAACTTCCTTCAA 

ATTCTTGTTAATGGATATAACACATGGAATCTACATGTAAATGAAAGTTGGTGGAGTCCACA 

ATTTTTCTTTAAAATGATTAGTTTGGCTGATTGCCCCTAAAAAGAGAGATCTGATAAATGGC 

TCTTTTTAAATTTTCTCTGAGTTGGAATTGTCAGAATCATTTTTTACATTAGATTATCATAA 

TTTTAAAAATTTTTCTTTAGTTTTTCAAAATTTTGTAAATGGTGGCTATAGAAAAACAACAT 

GAAATATTATACAATATTTTGCAACAATGCCCTAAGAATTGTTAAAATTCATGGAGTTATTT 

GTGCAGAATGACTCCAGAGAGCTCTACTTTCTGTTTTTTACTTTTCATGATTGGCTGTCTTC 

CCATTTATTCTGGTCATTTATTGCTAGTGACACTGTGCCTGCTTCCAGTAGTCTCATTTTCC 

CTATTTTGCTAATTTGTTACTTTTTCTTTGCTAATTTGGAAGATTAACTCATTTTTAATAAA 

ATTATGTCTAAGATTAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAA 



BNSOOCtO: <wo_sgsxeaA2j.> 



wo 99/63088 PCr/US99/12252 

FIGURE 66 

MRVRIGLTLLLCAVLLSLASASSDEEGSQDESLDSKTTLTSDESVKDHTTAGRWAGQIFLD 
SEESELESSIQEEEDSLKSQEGESVTEDISFLESPNPENKDYEEPKKVRKPALTAIEGTAHG 
EPCHFPFLFLDKEYDECTSDGREDGRLWCATTYDYKADEKWGFCETEEEAAKRRQMQEAEMM 
YQTGMKILNGSNKKSQKREAYRYLQKAASMNHTKALERVSYALLFGDYLPQNIQAAREMFEK 
LTEEGSPKGQTALGFLYASGLGVNSSQAKALVYYTFGALGGNLIAHMVLVSRL 
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FIGURE 67 

CTTCCCAGCCCTGTGCCCCAAAGCACCTGGAGCATATAGCCTTGCAGAACTTCTACTTGCCT 

GCCTCCCTGCCTCTGGCC^CCTGCCGGTGCCTCAGCTTCCTTCTGATGGGGACCTTCCT 

GTCAGTTTCCCAGACAGTCCTGGCCCAGCTGGATGCACTGCTGGTCTTCCCAGGCCAAGTGG 

CTCAACTCTCCTGCACGCTCAGCCCCCAGCACGTCJVCCATCAGGGACTACGGTGTGTCCTGG 

TACCAGCAGCGGGCAGGCAGTGCCCCTCGATATCTCCTCTACTACCGCTCGGAGGAGGATCA 

CCACCGGCCTGCTGACATCCCCGATCGATTCTCGGCAGCCAAGGATGAGGCCCACAATGCCT 

GTGTCCTCACCATTAGTCCCGTGCAGCCTGAAGACGACGCGGATTACTACTGCTCTGTTGGC 

TACGGCTTTAGTCCCTAGGGGTGGGGTGTGAGATGGGTGCCTCCCCTCTGCCTCCCATTTCT 

GCCCCTGACCTTGGGTCCCTTTTAAACTTTCTCTGAGCCTTGCTTCCCCTCTGTAAAATGGG 
TTAATAATATTCAACATGTCAACAAC 
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FIGURK i{f< 

MACRCLSFLI^GTFLSVSQTVLAQLDALLVFPGQVAQLSCTLSPQHVTIRDYGVSWYQQRAG 
SAPRYLLYYRSEEDHHRPADIPDRFSAAKDEAHNACVLTISPVQPEDDADYYCSVGYGFSP 
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FIGURE m 

SJSJ^^S^^^^^^^^^'^^^^aacggcttcggcagcctgagStSXct^ 
^^^^SoIS^^^™^^^^gggaaggagagcctggggcccgaSgotS^^ 
^SS^S^^cgccagccagcagtgggcacgaccgcgcttcacacScS^ 
SS^^I^^^tcgcacggcccgtgggtagctccgtgcggctcaagt^^^ 

gtttgtggtgctgcccacgggtgacgtgtggtcgcggcccScS^^tc^^^ 
acctgtggcctcctcgtcctcggccactagcctgc?g^^ 



ATATTGCCTGGACACACACACACACACACGCGTGCAcSTirGCTG^^ 



SfJS^n^^oS^°^^^°<^^^^^^ACACAGATAATGCTGCCTCAACACTCASc^^^ 

JS^^^SS^ggacacacacatgtgcacagatatgctgtctggacatcSSS???^^ 

^?SI?TSS2^I^''^^^^^^^^^^A^CATGCAGATATC^^ 
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FIGURE 7n 

MTPSPLLLLLLPPLLLGAFPPAAAARGPPKMADKWPRQVARLGRTVRLQCPVEGDPPPLTM 

WTKDGRTIHSGWSRFRVLPQGLKVKQVEREDAGVYVCKATNGFGSLSVNYTLWLDDISPGK 

ESLGPDSSSGGQEDPASQQWARPRFTQPSKMRRRVIARPVGSSVRLKCVASGHPRPDITOMK 

DDQALTRPET^PRKKKWTLSLKNLRPEDSGKYTCRVSNRAGAINATYKVDVIQRTRSKPVL 

TGTHPVNTTVDPGGTTSFQCKVRSDVKPVlQWLKRVEyGAEGRHNSTIDVGGQKFVVLPTGD 

VWSRPDGSYLNKLLITRARQDDAGMYICLGANTMGYSFRSAFLTVLPDPKPPGPPVASSSSA 

TSLPWPWIGIPAGAVFILGTLLLWLCQAQKKPCTPAPAPPLPGHRPPGTARDRSGDKDLPS 

LAALSAGPGVGLCEEHGSPAAPQHLLGPGPVAGPKLYPKLYTDIHTHTHTHSHTHSHVEGKV 
HQHIHYQC 
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FIGURE 71 A 

CCCAGCTGAGGAGCCCTGCTCAAGACACGGTCACTGGATCTGAGAAACTTCCCAGGGGACCG 
CATTCCAGAGTCAGTGACTCTGTGAAGCACCCACATCTACCTCT^ 

?????GSSS^^J??S?f^SS^"°^'^^'^"^^'^^^TTCCTGGTCCTGGAAGT^^^ 

^CAGGGAGCAGCGGCCTGGCCAGAACTGCTCTAATTACACCGTACGCTTCCT^ 
GGATCCCTGCGCCGAGACACAGAGCGCATCTGGAGCCCATGGTCTCCCTGGAGCAAGTGCTC 
AGCTGCCTGTGGTCAGACTGGGGTCCAGACTCGCACACGCATTTGCTT^^ 
^^^^S^^^^^^^^^G^CAGCGAAGAGGGTCAGCACTGCATGGGCCAGGA^^ 

gacctgacctgcccaatgggccaggtgaatgctgactgtgatgcctgcaSt^^ 

S^IS^TJS^TGGGGOTGTCTCCCTTCCCGGAGGTGCCCCAGCC^^^ 

tcctgaccaagacgccgaagctgctgacccagacagacagtgatgggagattcSSccct 

GGCTTGTGCCCTGATGGCAAAAGCATCCTGAAGATCACAAAGGTC^^^ 

^S^^T^^A^^^^^^^'^^^'^^AACCCTGAGACAAAAGCACGGAGAGCTGGGCAG^^ 
CTGTGCTGTAAGGCCACAGGGAAGCCCAGGCCAGACAAGTATTTTTGGTATCAT^^^ 

attgctggatccttccctctacaagcatgagagcaagctggtgctgaggSct^^ 

gcccagctgattgtcacagcatctgatgagactccttgcaacccagttcctcagagctSct 

tatccggctgccccatgattgctttcagaatgccaccaactccttSSatgac^ 

gctgccctgttaagacttgtgcagggcagcaggataatgVgSSggtgc?^^^ 

fS??^^^??JS^^^^^^^^°ACAGAGGAAAGGGAGA?cSJ^^ 

acccaccaaggtggccaaggagtgcagctgccagcggtgtacggaaactcgg^^ 

AACAGCCGTGTAAGCATGACTGGCTACAAGGGCACTTTCACCCTCCATGTCCCCCAGrA^ 
TGAGAGGCTGGTGCTCACATTTGTGGACAGGCTGCAGAi^GTTTG^^^ 

SSg^^??^^^^^^^ 

^^aggaaaagtgaaggccagtgtgaccttcctggatccccggSStt^^^ 

gcccagactgacctgaacttcatcaatgacgaaggagacactttccccottcggaSta^^^ 

catgttctctgtggacttcagagatgaggtcacctcagagccactStgctggcS 

aggtccaccttgactcgacccaggtcaagatgccagagcacatatcc^^^ 

tcactcaatccagacacagggctgtgggaggaggaaggtgatitcSat^^^ 

gaggaacaaaagagaagacagaaccttcctggtgggcaacctgga^t^^^ 

TCTTTAACCTGGATGTTCCTGAAAGCAGGCGGTGCTTTCTTilG^^^ 

gagaggttcttgcctagtgagcagatccagggggttgtgatctcStg^^^ 
tagaactggcttcttgtccaaccctagggcctggggccgctttScag^^^ 
ccaacggggcctgtgtgcctgccttctgtgatgaccagtcccctgat^^^ 
gtcttggcaagcctggctggggaggaactgcaagcagtggagtcttctStaSttcaIc^ 

?5^;fS^^^^°^^^^^^^'^CA^CCCTATCTCAACAAGCTCAA6T^^^ 

AGGATCCACGGGTTAAAAAGACAGCTTTCCAGATTAGCATGGCCAASciAGGCCCAScTrA 
?S!??2S^^A^^^^^^°^^^^TCTATGCCTTTGAGAACCTCcS^ 

JSSJSSI^cagcccacttccggttctaccagattgagggggatcgatatSct^^ 

TCCCCTTCAACGAAGATGACCCTATGAGCTGGACTGAAGACTATCTGGCATGGTGrrrA^ 

GCGATCCCGCAACATGGGGGGCACTCATCGGCGGACAGTGGGGAAGCTGTA^GGAATrrrar 
ATGTGAGGAGCACTCGGGACAGGGACCAGCCCAATGTCTCA^^^^ 

tgcagtgggatgctctatgatcaggaccgtgtggaccgcaccctggtgS^^ 

GGGCAGCTGCCGTCGAGCCAGTGTGAACCCCATGCTGCATGAG^^ 
CACTTGCAGTCAACAACGACACCAGTGAGTACACCATGCTGGCACCCTT^^^ 

CGGCCGGTGCTTTGATGGCACATCCGATGGCTCCTCCAGAATCATGAAGAGCAATGTrSJr 

tagccctcaccttcaactgtgtagagaggcaagtaggccgccSa^^ 

?SJ?o^^^^^''''^^^°'^^^^^T°^'rGCAGGCACTGTCCAAGG^GAGTC^^ 
GCAGCAGCGAGCGAGCAGGGGTGGCCAGCGCCAGGGTGGAGTGGTGGCCT^^^^^ 
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MVGTKAWVFSFLVLEVTSVLGRQTMLTQSVRRVQPGKKNPSIFAKPADTLESPGEWTTWFNI 
DYPGGKGDYERLDAIRFYYGDRVCARPLRLEARTTDWTPAGSTGQWHGSPREGFWCLNREQ 
RPGQNCSNYTVRFLCPPGSLRRDTERIWSPWSPWSKCSAACGQTGVQTRTRICLAEMVSLCS 
EASEEGQHCMGQDCTACDLTCPMGQVNADCDACMCQDFMLHGAVSLPGGAPASGAAIYLLTK 
TPKLLTQTDSDGRFRIPGLCPDGKSILKITKVKFAPIVLTMPKTSLKAATIKAEFVRAETPY 
MVMNPETKARRAGQSVSLCCKATGKPRPDKYFWYHNDTLLDPSLYKHESKLVLRKLQQHQAG 
EYFCKAQSDAGAVKSKVAQLIVTASDETPCNPVPESYLIRLPHDCFQNATNSFYYDVGRCPV 
KTCAGQQDNGIRCRDAVQNCCGISKTEEREIQCSGYTLPTKVAKECSCQRCTETRSIVRGRV 
SAADNGEPMRFGHVYMGNSRVSMTGYKGTFTLHVPQDTERLVLTFVDRLQKFVNTTKVLPFN 
KKGS AVFHE I KMLRRKEP ITLEAMETNI I PLGEWGEDPMAELE I PSRSFYRQNGEP Y IGKV 

KASVTFLDPRNISTATAAQTDLNFINDEGDTFPLRTYGMFSVDFRDEVTSEPLNAGKVKVHL 
DSTQVKMPEHISTVKLWSLNPDTGLWEEEGDFKFENQRRNKREDRTFLVGNLEIRERRLFNL 
DVPESRRCFVKVRAYRSERFLPSEQIQGWISVINLEPRTGFLSNPRAWGRFDSVITGPNGA 
CVPAFCDDQSPDAYSAYVLASLAGEELQAVESSPKFNPNAIGVPQPYLNKLNYRRTDHEDPR 
VKKTAFQI SMAKPRPNSAEESNGP I YAFENLRACEEAPPSAAHFRFYQIEGDRYDYNTVPFN 
EDDPMSWTEDYLAWWPKPMEFRACYIKVKIVGPLEVNVRSRNMGGTHRRTVGKLYGIRDVRS 
TRDRDQPNVSAACLEFKCSGMLYDQDRVDRTLVKVIPQGSCRRASVNPMLHEYLVNHLPLAV 
NNDTSEYTMLAPLDPLGHNYGIYTVTDQDPRTAKEIALGRCFDGTSDGSSRIMKSNVGVALT 

FNCVERQVGRQSAFQYLQSTPAQSPAAGTVQGRVPSRRQQRASRGGQRQGGWASLRFPRVA 
QQPLIN 
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CTGCAAGTTGTTAACGCCTAACACACAAGTATGTTAGGCTTCCACCAAAGTCCTCAATATAC 
CTGAATACGCACAATATCTTAACTCTTCATATTTGGTTTTGG^^^^ 

cttcatttaaaaaaaaatacagagacctacctacccgtacgcataStSS^^ 

ATATGTAAACTAGACAAAGATCGCAGATCATAAAGCAAGCTC^GcS^^^^ 

TTACAAAGAATTTAGAGaSSTATTTGTCAAGATCCCTGTCGATTCATGCCCTTTGGGTTACG 

GTGTCCTCAGTGATGCAGCCCTACCCTTTGGTTTGGGGACATTATGA^^^^ 

GATTTACACGGAAGAAGGGAAAGTTTGGGArrACATGGCCTGCCAGCCGGS^^ 

TGACAAAATATCTGAAAGTGAAACTCGATCCTCCGGATATTACCTGTGGAGAC^^^ 

ACGTTCTGTGCAATGGGCAATCCCTACATGTGCAATAATGAGTGTGATGCG^^^^ 

GCTGGCACACCCCCCTGAGCTGATGTTTGATTTTGAAGGAAGACATCCCT^^ 

agtctgccacttggaaggagtatcccaagcctctccaggttaaStSSc^^SS^^^ 

aaaaccattgagctaacagacaacatagttattacctttgaatctgggS?^^^ 

gatcctggagaagtctctcgattatggacgaacatggcagccctatcagtatS?g^^ 

actgcttagatgcttttcacatggatcctaaatccgtgaaggatttatSSS^^^ 

ttagaaatcatttgcacagaagagtactcaacagggtatacaacaa^^SS^?^^ 

ctttgaaatcaaagacaggttcgcgctttttgctggacctcgcctacgcaatatcg^^^ 

tctacggacagctggatacaaccaagaaactcagagatttctttacS^^^S^^^^ 

ataaggctgttaagaccagccgttggggaaatatttgtagatgagSaSc^^^^ 

tgtgggaaatgcaagaagaattatcagggccgaccttggagtccaggctccta?SS^^ 
ccccaaaggcactgcaaatacctgtatccccagtatttccagtattgg^aS^^^^ 
acaacgagctcctgcactgccagaacggagggacgtgccacaacaaStgcgctgS^^ 
ccggccgcatacacgggcatcctctgcgagaagctgcggtgcgaggagS^^ 

ctccgactctggccagggcgcgcccccgcacggcaccccagcgctgctgSg^Sa^^^^ 

g?g™ggS?^^^^^ 

cacccgtggacagcacatccgagtcaagactgttaatttctgactccagagSStggSg^ 

TGTTGATATTATCACTGCAAATCACATTGCCA^^^^^^ 

GCTACTCTAGCGTGGTGCGCCCTAGTACGACTCCGCCCAGTGTGTGGACCAACCAA^?^^^^ 

TTCTTTGCTGTCAGGTGCATTGTGGGCATAAGGAAATCTGTTACAAGC^^^ 

GCTTCCGTCCCTGAATCCCTTCCAACCTGTGCTTTAGTGAACGTTGCTCTG?^^^^ 

GGTTGAAAGATTTCTTTGTCTGATGTTAGTGATGCACATGTGTAAcScC^^^ 

GCAAGCCAGTCATACCCCTGTATATCITAGCAGCACTGAGTCCA^ 

tatacaagagtggctataggaaaaaagaaagtgtatctatccttttgtattSS?g^^^ 
caatctgttaatgtatctaattcgaatcagcaaagactgacattttattttgtcSot^^ 

TTCTTGTTGCTTTAACACTGGAAGATTTAAAGAATAAAAACTCCTGCATSAC^Im^^ 

actgatttctgtgtggactgagtacattcagctgacgaatttagttcccaggaaStgg^^ 
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MYLSRSLSIHALWVTVSSVMQPYPLVWGHYDLCKTQIYTEEGKVWDYMACQPESTDMTKYLK 

VKLDPPDITCGDPPETFCAMGNPYMCNNECDASTPELAHPPELMPDFEGRHPSTFWQSATWK 

EYPKPLQVNITLSWSKTIELTDNIVITFESGRPDQMILEKSLDYGRTWQPYQYYATDCLDAF 

HMDPKSVKDLSQHTVLEIICTEEYSTGYTTNSKIIHFEIKDRFALFAGPRUINMASLYGQLD 

TTKKLRDFFTVTDLRIRLLRPAVGEIFVDEIJIIJUIYFYAISDIKVRGRCKCNLHATVCVYDN 

SKLTCECEHNTTGPDCGKCKKNYQGRPWSPGSYLPIPKGTANTCIPSISSIGTNVCDNELLH 

CQNGGTCHNNVRCLCPAAYTGILCEKLRCEEAGSCGSDSGQGAPPHGTPALLLLTTLLGTAS 
PLVF 
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CCCACGCGTCCGGGTGACCTC3GGCCGAGCCCTCCCGGTCGGCTAAGATTGCTGAGGAGGCGG 

CGGGTAGCTGGCAGGCGCCGACTTCCGAAGGCCGCCGTCCGGGCGAGGTGTCCTCATGACTT 

CTCTTGTGGACCATGTCCGTGATCTTTTTTGCCTGCGTGGTACGGGTAAGGGATGGACTGCC 

CCTCTO^GCCTCTACTGATTTTTACCACACCCAAGATTTTTTGGAATGGAGGAGACGGCTCA 

AGAGTTTAGCCTTGCGACTGGCCCAGTATCCAGGTCGAGGTTCTGCAGAAGGTTGTGACTTT 

AGTATACATTTTTCTTCTTTCGGGGACGTGGCCTGCATGGCTATCTGCTCCTGCCAGTGTCC 

AGCAGCCATGGCCTTCTGCTTCCTGGAGACCCTGTGGTGGGAATTCACAGCTTCCTATGACA 

CTACCTGCATTGGCCTAGCCTCCAGGCCATACGCTTTTCTTGAGTTTGACAGCATCATTCAG 

AAAGTGAAGTGGCATTTTAACTATGTAAGTTCCTCTCAGATGGAGTGCAGCTTGGAAAAAAT 

TCAGGAGGAGCTCAAGTTGCAGCCTCCAGCGGTTCTCACTCTGGAGGACACAGATGTGGCAA 

ATGGGGTGATGAATGGTCACACACCGATGCACTTGGAGCCTGCTCCTAATTTCCGAATGGAA 

CCAGTGACAGCCCTGGGTATCCTCTCCCTCATTCTCAACATCATGTGTGCTGCCCTGAATCT 

CATTCGAGGAGTTCACCTTGCAGAACATTCTTTACAGGATCCAAGGAGCTGGTTCTGCTGGT 

TGGACCAAACCTCGTGAGCCAGCCACCCCTGACCCAAATGAGGAGAGCTCTGATTCTCCCAT 

CCGGGAGCAGTGATGTCAAACTTCTGCTGCTGGGGAAATCTCATCAGCAGGGAGCCTGTGGA 

AAAGGGCATGTCAGTGAAATCTGGGAATGGCTGGATTCGGAAACATCTGCCCATGTGTATTG 

ATGGCAGAGCTGTTGCCCACAAGCGCCTTTTATTTAGGGTAAAATTAACAAATCCATTCTAT 

TCCTCTGACCCATGCTTAGTACATATGACCTTTAACCCTTACATTTATATGATTCTGGGGTT 

GCTTCAGAAGTGTTATTTCATGAATCATTCATATGATTTGATCCCCCAGGATTCTATTTTGT 

TTAATGGGCTTTTCTACTAAAAGCATAAAATACTGAGGCTGATTTAGTCAGGGCAAAACCAT 

TTACTTTACATATTCGTTTTCAATACTTGCTGTTCATGTTACACAAGCTTCTTACGGTTTTC 

TTGTAACAATAAATATTTTGAGTAAATAATGGGTACATTTTAACAAACTCAGTAGTACAACC 

TAAACTTGTATAAAAGTGTGTAAAAATGTATAGCCATTTATATCCTATGTATAAATTAAATG 

AGGTGGCTTCAGAAATGGCAGAATAAATCTAAAGTGTTTATTAAAAAAAAAAAAAAAAAAAA 
AAAAG 
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FIGURE 76 

MSVIFFACWRVRDGLPLSASTDFYHTQDFLEWRRRLKSLALRLAQYPGRGSAEGCDFSIHF 
SSFGDVACMAI CSCQCPAAMAFCFLETLWWEFTASYDTTCIGLASRPYAFLEFDS I IQKVKW 
HFNYVSSSQMECSLEKIQEELKLQPPAVLTLEDTDVANGVMNGHTPMHLEPAPNFRMEPVTA 
LGILSLILNIMCAALNLIRGVHLAEHSLQDPRSWFCWLDQTS 
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FIGURE 77 

TGCTTCCTGGAGACCCTGTGGTGGGAATTOVCAGCTTCNTATGACACTACCTGCATTGGCNT 
AGCCTCCAGGCCATACGCTTTTCTTGAGTTTGAO^GCATCATTCAGAAAGTGAAGTTO 
TTAACTATGTAAGTTCCnOTCAGATGGAGTGCAGCTTGGAAAAAATTCAGGAGGAGCTCAAG 
TTGCAGCCTCCAGCGGTTCTCANTATGGAGGACACAGATGTGGCAAATGGGGT 
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FIGURE 7X 

CTCAGCGGCGCTTCCTCGTAGCGAGCCTAGTGGCGGGTGTTTGCATTGAAACGTGAGCGCGA 

CCCGACCTTAAAGAGTGGGGAGCAAAGGGAGGACAGAGCCCTTTAAAACGAGGCGGGTGGTG 

CCTGCCCCTTTAAGGGCGGGGCGTCCGGACGACTGTATCTGAGCCCCAGACTGCCCCGAGTT 

TCTGTCGCAGGCTGCGAGGAAAGGCCCCTAGGCTGGGTCTGGGTGCTTGGCGGCGGCGGCTT 

CCTCCCCGCTCGTCCTCCCCGGGCCCAGAGGCACCTCGGCTTCAGTCATGCTGAGCAGAGTA 

TGGAAGCACCTGACTACGAAGTGCTATCCGTGCGAGAACAGCTATTCCACGAGAGGATCCGC 

GAGTGTATTATATCAACACTTCTGTTTGCAACACTGTACATCCTCTGCCACATCTTCCTGAC 

CCGCTTCAAGAAGCCTGCTGAGTTCACCACAGTGGATGATGAAGATGCCACCGTCAACAAGA 

TTGCGCTCGAGCTGTGCACCTTTACCCTGGCAATTGCCCTGGGTGCTGTCCTGCTCCTGCCC 

TTCTCCATCATCAGCAATGAGGTGCTGCTCTCCCTGCCTCGGAACTACTACATCCAGTGGCT 

CAACGGCTCCCTCATCCATGGCCTCTGGAACCTTGTTTTTCTCTTCCCCAACCTGTCCCTCA 

TCTTCCTCATGCCCTTTGCATATTTCTTCACTGAGTCTGAGGGCTTTGCTGGCTCCAGAAAG 

GGTGTCCTGGGCCGGGTCTATGAGACAGTGGTGATGTTGATGCTCCTCACTCTGCTGGTGCT 

AGGTATGGTGTGGGTGGCATCAGCCATTGTGGACAAGAACAAGGCCAACAGAGAGTCACTCT 

ATGACTTTTGGGAGTACTATCTCCCCTACCTCTACTCATGCATCTCCTTCCTTGGGGTTCTG 

CTGCTCCTGGTGTGTACTCCACTGGGTCTCGCCCGCATGTTCTCCGTCACTGGGAAGCTGCT 

AGTCAAGCCCCGGCTGCTGGAAGACCTGGAGGAGCAGCTGTACTGCTCAGCCTTTGAGGAGG 

CAGCCCTGACCCGCAGGATCTGTAATCCTACTTCCTGCTGGCTGCCTTTAGACATGGAGCTG 

CTACACAGACAGGTCCTGGCTCTGCAGACACAGAGGGTCCTGCTGGAGAAGAGGCGGAAGGC 

TTCAGCCTGGCAACGGAACCTGGGCTACCCCCTGGCTATGCTGTGCTTGCTGGTGCTGACGG 

GCCTGTCTGTGCTCATTGTGGCCATCCACATCCTGGAGCTGCTCATCGATGAGGCTGCCATG 

CCCCGAGGCATGCAGGGTACCTCCTTAGGCCAGGTCTCCTTCTCCAAGCTGGGCTCCTTTGG 

TGCCGTCATTCAGGTTGTACTCATCTTTTACCTAATGGTGTCCTCAGTTGTGGGCTTCTATA 

GCTCTCCACTCTTCCGGAGCCTGCGGCCCAGATGGCACGACACTGCCATGACGCAGATAATT 

GGGAACTGTGTCTGTCTCCTGGTCCTAAGCTCAGCACTTCCTGTCTTCTCTCGAACCCTGGG 

GCTCACTCGCTTTGACCTGCTGGGTGACTTTGGACGCTTCAACTGGCTGGGCAATTTCTACA 

TTGTGTTCCTCTACAACGCAGCCTTTGCAGGCCTCACCACACTCTGTCTGGTGAAGACCTTC 

ACTGCAGCTGTGCGGGCAGAGCTGATCCGGGCCTTTGGGCTGGACAGACTGCCGCTGCCCGT 

CTCCGGTTTCCCCCAGGCATCTAGGAAGACCCAGCACCAGTGACCTCCAGCTGGGGGTGGGA 

AGGAAAAAACTGGACACTGCCATCTGCTGCCTAGGCCTGGAGGGAAGCCCAAGGCTACTTGG 

ACCTCAGGACCTGGAATCTGAGAGGGTGGGTGGCAGAGGGGAGCAGAGCCATCTGCACTATT 

GCATAATCTGAGCCAGAGTTTGGGACCAGGACCTCCTGCTTTTCCATACTTAACTGTGGCCT 

CAGCATGGGGTAGGGCTGGGTGACTGGGTCTAGCCCCTGATCCCAAATCTGTTTACACATCA 

ATCTGCCTCACTGCTGTTCTGGGCCATCCCCATAGCCATGTTTACATGATTTGATGTGCAAT 

AGGGTGGGGTAGGGGCAGGGAAAGGACTGGGCCAGGGCAGGCTCGGGAGATAGATTGTCTCC 

CTTGCCTCTGGCCCAGCAGAGCCTAAGCACTGTGCTATCCTGGAGGGGCTTTGGACCACCTG 

AAAGACCAAGGGGATAGGGAGGAGGAGGCTTCAGCCATCAGCAATAAAGTTGATCCCAGGGA 
AAAAAA 
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MEAPDYEVLSVREQLFHERIRECIISTLLFATLYILCHIFLTRFKKPAEPTTVDDEDATVNK 
lALELCTFTLAIALGAVLLLPFSIISNEVLLSLPRNYYIQWLNGSLIHGLWNLVFLFPNLSL 
IFI^PFAYFFTESEGFAGSRKGVLGRVYETVVMLMLLTLLVLCSNrVWVASAIVDKNKANRESL 
YDFWEYYLPYLYSCISFLGVLLLLVCTPLGLARMFSVTGKLLVKPRLLEDLEEQLYCSAFEE 
AALTRRICNPTSCm^PLDMELLHRQVlAI^TQRVLLEKRRKASAWQRNLGYPLAMLCLLVLT 
GLSVLIVAIHILELLIDEAAMPRGMQGTSLGQVSFSKLGSFGAVIQWLIFYLMVSSWGFY 
SSPLFRSLRPRWHDTAMTQIIGNCVCLLVLSSALPVFSRTLGLTRFDLLGDFGRFNWLGNFY 
IVFLYNAAFAGLTTLCLVKTFTAAVRAELIRAFGLDRLPLPVSGFPQASRKTQHQ 
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FIGURE 80 

GGCTGCCGAGGGAAGGCCCCTTGGGTTGGTCTTGGTTGCTTGGCGGCGGCGGNTTCNTCCCC 
GCTCGTCCTCCCCGGGCCCAGAGGCACCTCGGCTTCAGTCATGCTGAGCAGAGTATGGAAGC 
ACCTGACTACGAAGTGCTATCCGTGCGAGAACAGCTATTCCACGAGAGGATCCGCGAGTGTA 
TTATATCAACACTTCTGTTTGCAACACTGTACATCCTCTGCCACATCTTCCTGACCCGCTTC 
AAGAAGCCTGCTGAGTTCACCACAGTGGATGATGAAGATGCCACCG 



BNSOOCID: <WO_S9a30ea*2J.> 



wo 99/63088 



FIGURE 81 



PCT/US99/12252 



GACCGACCTTAAAGAGTGGGAGCAAAGGGAGGACAGAGCCTTTTAAAACGAGGCGGTGGTGC 

CTGCCCTTTAAGGGCGGGGCGTCCGGACGACTGTATCTGAGCCCCAGACTGCCCCGAGTTTC 

TGTCGCAGGCTGCGAGGAAAGGCCCCTAGGCTGGGTCTGGTGCTTGGCGGCGGCGGCTTCCT 

CCCCGTTGTCNTCCCCGGGCCCAGAGGCACCTCGGCTTCAGTCATGCTGAGCAGAGTATGGA 

AGCACCTGACTACGAAGTGCTATCCGTGCGAGAACAGCTATTCCACGAGAGGATCCGCGAGT 

GTATTATATCAACACTTCTGTTTGCAACACTGTACATCNTCTGCCACATCTTCCTGACCCGC 

TTCAAGAAGCCTGCTGAGTTCACCACAGTGGATGATGAAGATGCCACCGTCAACAAGATTGC 

GCTCGAGCTGTGCACCTTTACCCTGGCAATTGCCCTGGGTGCTGTCCTGCTCCTGCCCTTCT 
CCATCATCAGCAATGAGGTGCTGCACTCCC 
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GATGTGCTCCTTGGAGCTGGTGTGCAGTGTCCTGACTGTAAGATCAAGTCCAAACCTGTTTT 
GGAATTGAGGAAACTTCTCTTTTGATCTCAGCCCTTGGTGGTCCAGGTCTTCATGCTGCTGT 
GGGTGATATTACTGGTCCTGGCTCCTGTCAGTGGACAGTTTGCAAGGACACCCAGGCCCATT 

attttcctccagcctccatggaccacagtcttccaaggagagagagtgaccctcacttgcaa' 

GGGATTTCGCTTCTACTCACCACAGAAAACAAAATGGTACCATCGGTACCTTGGGAAAGAAA 

tactaagagaaaccccagacaatatccttgaggttcaggaatctggagagtacagatgccag 

GCCCAGGGCTCCCCTCTCAGTAGCCCTGTGCACTTGGATTTTTCTTCAGAGATGGGATTTCC 

tcatgctgcccaggctaatgttgaactcctgggctcaagtgatctgctcacctasgcctctc 

AAAGCGCTGGGATTACAGCTTCGCTGATCCTGCAAGCTCCACTTTCTGTGTTTGAAGGAGAC 

tctgtggttctgaggtgccgggcaaaggcggaagtaacactgaataatactatttacaagaa 

TGATAATGTCCTGGCATTCCTTAATAAAAGAACTGACTTCCAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 83 

MLLWVILLVLAPVSGQFARTPRPIIFLQPPWTTVFQGERVTLTCKGFRPYSPQKTKWYHRYL 
GKEILRETPDNILEVQESGEYRCQAQGSPLSSPVHLDFSSEMGFPHAAQANVELLGSSDLLT 
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FIGURE 84 

CAGAAGAGGGGGCTAGCTAGCTGTCTCTGCGGACCAGGGAGACCCCCGCGCCCCCCCGGTGT 

GAGGCGGCCTCACAGGGCCGGGTGGGCTGGCGAGCCGACGCGGCGGCGGAGGAGGCTGTGAG 

GAGTGTGTGGAACAGGACCCGGGACAGAGGAACC^GCTCCGCAGAACCTGAGCACCTTTT 

GCCTGTTGCTGCTATACCTCATCGGGGCGGTGATTGCCGGACGAGATTTCTATAAGATCTTG 

GGGGTGCCTCGAAGTGCCTCTATAAAGGATATTAAAAAGGCCTATAGGAAACTAGCCCTGCA 

GCTTCATCCCGACCGGAACCCTGATGATCCACAAGCCCAGGAGAAATTCCAGGATCTGGGTG 

CTGCTTATGAGGTTCTGTCAGATAGTGAGAAACGGAAACAGTACGATACTTATGGTGAAGAA 

GGATTAAAAGATGGTCATCAGAGCTCCCATGGAGACATTTTTTCACACTTCTTTGGGGATTT 

TGGTTTCATGTTTGGAGGAACCCCTCGTCAGCAAGACAGAAATATTCCAAGAGGAAGTGATA 

TTATTGTAGATCTAGAAGTCACTTTGGAAGAAGTATATGCAGGAAATTTTGTGGAAGTAGTT 

AGAAACAAACCTGTGGCAAGGCAGGCTCCTGGCAAACGGAAGTGCAATTGTCGGCAAGAGAT 

GCGGACCACCCAGCTGGGCCCTGGGCGCTTCCAAATGACCCAGGAGGTGGTCTGCGACGAAT 

GCCCTAATGTCAAACTAGTGAATGAAGAACGAACGCTGGAAGTAGAAATAGAGCCTGGGGTG 

AGAGACGGCATGGAGTACCCCTTTATTGGAGAAGGTGAGCCTCACGTGGATGGGGAGCCTGG 

AGATTTACGGTTCCGAATCAAAGTTGTCAAGCACCCAATATTTGAAAGGAGAGGAGATGATT 

TGTACACAAATGTGACAATCTCATTAGTTGAGTCACTGGTTGGCTTTGAGATGGATATTACT 

CACTTGGATGGTCACAAGGTACATATTTCCCGGGATAAGATCACCAGGCCAGGAGCGAAGCT 

ATGGAAGAAAGGGGAAGGGCTCCCCAACTTTGACAACAACAATATCAAGGGCTCTTTGATAA 

TCACTTTTGATGTGGATTTTCCAAAAGAACAGTTAACAGAGGAAGCGAGAGAAGGTATCAAA 

CAGCTACTGAAACAAGGGTCAGTGCAGAAGGTATACAATGGACTGCAAGGATATT^GAGTG 

AATAAAATTGGACTTTGTTTAAAATAAGTGAATAAGCGATATTTATTATCTGCAAGGTTTTT 

TTGTGTGTGTTTTTGTTTTTATTTTCAATATGCAAGTTAGGCTTAATTTTTTTATCTAATGA 

TCATCATGAAATGAATAAGAGGGCTTAAGAATTTGTCCATTTGCATTCGGAAAAGAATGACC 

AGCAAAAGGTTTACTAATACCTCTCCCTTTGGGGATTTAATGTCTGGTGCTGCCGCCTGAGT 

TTCAAGAATTAAAGCTGCAAGAGGACTCCAGGAGCAAAAGAAACACAATATAGAGGGTTGGA 

GTTGTTAGCAATTTCATTCAAAATGCCAACTGGAGAAGTCTGTTTTTAAATACATTTTGTTG 
TTATTTTTA 
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FIGURE Sj; 

MAPQNLSTFCLLLLYLIGAVIAGRDFYKILGVPRSASIKDIKKAYRKLALQLHPDRNPDDPQ 
AQEKFQDLGAAYEVLSDSEKRKQYDTYGEEGLKDGHQSSHGDIFSHFPGDFGFMFGGTPRQQ 
DRNIPRGSDIIVDLEVTLEEVYAGNFVEWRNKPVARQAPGKRKCNCRQEMRTTQLGPGRFQ 
MTQEWCDECPNVKLVNEERTLEVEIEPGVRDGMEYPFIGEGEPHVDGEPGDLRFRIKWKH 
PIFERRGDDLYTNVTISLVESLVGFEMDITHLDGHKVHISRDKITRPGAKLWKKGEGLPNFD 
NNNIKGSLIITFDVDFPKEQLTEEAREGIKQLLKQGSVQKVYNGLQGY 
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FIGURE S6 

TGGGACCAGGGAACCCCGGGCCCCCCGGTGGAGNGCCTAACAGGCCGGTGGNTGCGACCGAA 

GCGGCGGGCGGAGGAGGTTTTGAGGATTTTTGGAACAGGACCCGGACAGAGGAACCATGGTT 

CCGCAGAACNTGAGCACNTTTTGCCTGTTGNTGNTATACTTCATCGGGGCGGTGATTGCCGG 

ACGAGATTTNTATAAGATTTTGGGGTGCCTNGAAGTGCCTTNTATAAAGGATATTAAAAAGG 

CCTATAGGAAACTAGCCCTGCAGNTTTATCCCGACCGGAACCCTGATGATCCAC^^ 

GAGAAATTCCAGGATTTGGGTGCTGCTTATGAGGTTNTGTCAGATAGTGAGAAACGGAAACA 

GTACGATAATTATGGTGAAGAAGGATTAAAAGATGGTNATCAGAGCTCCCATGGAGACATTT 

TTTCACACTTNTTTGGGGATTTTGGTTTCATGTTTGGAGGAACCCCTNGTCAGCAAGAC^^^ 
AATATTCCAAGAG 
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GGCACGAGGCGGCGGGGCAGTCGCGGGATGCGCCCGGGAGCCACAGCCTGAGGCCCTCAGGT 

CTCTGCAGGTGTCGTGGAGGAACCTAGOlCCTGCCATCCrCTTCCCCAATTTGCCACTTCCA 

GCAGCTTTAGCCCATGAGGAGGATGTGACCGGGACTGAGTCAGGAGCCCTCTGGAAGCATGG 

AGACTGTGGTGATTGTTGCCATAGGTGTGCTGGCCACCATCTTTCTGGCTTCGTTTGCAGCC 

TTGGTGCTGGTTTGCAGGCAGCGCTACTGCCGGCCGCGAGACCTGCTGCAGCGCTATGATTC 

TAAGCCCATTGTGGACCTCATTGGTGCCATGGAGACCCAGTCTGAGCCCTCTGAGTTAGAAC 

TGGACGATGTCGTTATCACCAACCCCCACATTGAGGCCATTCTGGAGAATGAAGACTGGATC 

GAAGATGCCTCGGGTCTCATGTCCCACTGCATTGCCATCTTGAAGATTTGTCACACTCTGAC 

AGAGAAGCTTGTTGCCATGACAATGGGCTCTGGGGCCAAGATGAAGACTTCAGCCAGTGTCA 

GCGACATCATTGTGGTGGCCAAGCGGATCAGCCCCAGGGTGGATGATGTTGTGAAGTCGATG 

TACCCTCCGTTGGACCCCAAACTCCTGGACGCACGGACGACTGCCCTGCTCCTGTCTGTCAG 

TCACCTGGTGCTGGTGACAAGGAATGCCTGCCATCTGACGGGAGGCCTGGACTGGATTGACC 

AGTCTCTGTCGGCTGCTGAGGAGCATTTGGAAGTCCTTCGAGAAGCAGCCCTAGCTTCTGAG 

CCAGATAAAGGCCTCCCAGGCCCTGAAGGCTTCCTGCAGGAGCAGTCTGCAATT2AGTGCCT 

ACAGGCCAGCAGCTAGCCATGAAGGCCCCTGCCGCCATCCCTGGATGGCTCAGCTTAGCCTT 

CTACTTTTTCCTATAGAGTTAGTTGTTCTCCACGGCTGGAGAGTTCAGCTGTGTGTGCATAG 

TAAAGCAGGAGATCCCCGTCAGTTTATGCCTCTTTTGCAGTTGCAAACTGTGGCTGGTGAGT 

GGCAGTCTAATACTACAGTTAGGGGAGATGCCATTCACTCTCTGCAAGAGGAGTATTGAAAA 

CTGGTGGACTGTCAGCTTTATTTAGCTCACCTAGTGTTTTCAAGAAAATTGAGCCACCGTCT 

AAGAAATCAAGAGGTTTCACATTAAAATTAGAATTTCTGGCCTCTCTCGATCGGTCAGAATG 

TGTGGCAATTCTGATCTGCATTTTCAGAAGAGGACAATCAATTGAAACTAAGTAGGGGTTTC 

TTCTTTTGGCAAGACTTGTACTCTCTCACCTGGCCTGTTTCATTTATTTGTATTATCTGCCT 

GGTCCCTGAGGCGTCTGGGTCTCTCCTCTCCCTTGCAGGTTTGGGTTTGAAGCTGAGGAACT 

ACAAAGTTGATGATTTCTTTTTTATCTTTATGCCTGCAATTTTACCTAGCTACCACTAGGTG 

GATAGTAAATTTATACTTATGTTTCCCTCAAAAAAAAAAAAAAA 
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FIGURE 8« 

METWIVAIGVLATIFLASFAALVLVCRQRYCRPRDLLQRYDSKPIVDLIGAMETQSEPSEL 
ELDDWITNPHIEAILENEDWIEDASGLMSHCIAILKICHTLTEKLVAMTMGSGAKMKTSAS 
VSDIIWAKRISPRVDDWKSMYPPLDPKLLDARTTALLLSVSHLVLVTRNACHLTGGLDWI 
DQSLSAAEEHLEVLREAALASEPDKGLPGPEGFLQEQSAI 
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FIGURE 89 

GCTTCATTTCTCCCGACTCAGCTTCCCACCCTGGGCTTTCCGAGGTGCTTTCGCCGCTGTCC 

CCACCACTGCAGCCAISATCTCCTTAACGGACACGCAGAAAATTGGAATGGGATTAACAGGA 

TTTGGAGTGTTTTTCCTGTTCTTTGGAATGATTCTCTTTTTTGACAAAGCACTACTGGCTAT 

TGGAAATGTTTTATTTGTAGCCGGCTTGGCTTTTGTAATTGGTTTAGAAAGAACATTCAGAT 

TCTTCTTCCAAAAACATAAAATGAAAGCTACAGGTTTTTTTCTGGGTGGTGTATTTGTAGTC 

CTTATTGGTTGGCCTTTGATAGGCATGATCTTCGAAATTTATGGATTTTTTCTCTTGTTCAG 

GGGCTTCTTTCCTGTCGTTGTTGGCTTTATTAGAAGAGTGCCAGTCCrTG<^ 

TTACCTGGAATTAGATCATTTGTAGATAAAGTTGGAGAAAGCAACAATATGGTATAACAACA 

AGTGAATTTGAAGACTCATTTAAAATATTGTGTTATTTATAAAGTCATTTGAAGAATATTCA 

GCACAAAATTAAATTACATGAAATAGCTTGTAATGTTCTTTACAGGAGTTTAAAACGTATAG 

CCTACAAAGTACCAGCAGCAAATTAGCAAAGAAGCAGTGAAAACAGGCTTCTACTCAAGTGA 

ACTAAGAAGAAGTCAGCAAGCAAACTGAGAGAGGTGAAATCCATGTTAATGATGCTTAAGAA 

ACTCTTGAAGGCTATTTGTGTTGTTTTTCCACAATGTGCGAAACTCAGCCATCCTTAGAGAA 

CTGTGGTGCCTGTTTCTTTTCTTTTTATTTTGAAGGCTCAGGAGCATCCATAGGCATTTGCT 

TTTTAGAAGTGTCCACTGCAATGGCAAAAATATTTCCAGTTGCACTGTATCTCTGGAAGTGA 

TGCATGAATTCGATTGGATTGTGTCATTTTAAAGTATTAAAACCAAGGAAACCCCAATTTTG 

ATGTATGGATTACTTTTTTTTGNGCNCAGGGCC 
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FIGURE 90 

MISLTDTQKIGMGLTGFGVFFLFFGMILFFDKALLAIGNVLFVAGLAFVIGLERTFRFFFQK 
HKMKATGFFLGGVFWLIGWPLIGMIFEIYGFFLLFRGFFPVWGFIRRVPVLGSLLNLPGI 
RSFVDKVGESNNMV 

Important features : 
Transmeoibrane doiaains: 

amino acids 12-30 (typell) , 33-52, 69-89 and 93-109 

N-myristoylation sites. 

amino acids 11-16, 51-56 and 116-121 

Aminoacyl- transfer RNA synthetases class -II protein. 

amino acids 4 9-59 
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GAAGACGTGGCGGCTCTCGCCTGGGCTGTTTCCCGGCTTCATTTCTCCCGACTCAGCTTCCC 

ACCNTGGGCTTTCCGAGGTGCTTTCGCCGCTGTCCCCACCACTGCAGCCATGATCTCCTTAA 

CGGACACGCAGAAAATTGGAATGGGATTAACCGGATTTGGAGTGTTTTTCCTGTTCTTTGGA 

ATGATTCTCTTTTTTGACAAAGCACTACTGGCTATTGGAAATGTTTTATTTGTAGCCGGCTT 

GGCTTTTGTAATTGGTTTAGAAAGAACATTCAGATTCTTCTTCCAAAAACATAAAATGAAAG 

CTACAGGTTTTTTTCTGGGTGGTGTATTTGTAGTCCTTATTGGTTGGCCTTTGATAGGCATG 
ATCTTCGAAATTTATGGATTTTTTCTCTTGTTC 
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GGCACGAGGCTGAACCCAGCCGGCTCCATCTCAGCTTCTGGTTTCTAAGTCCATGTGCCAAA 

GGCTGCCAGGAAGGAGACGCCTTCCTGAGTCCTGGATCTTTCTTCCTTCTGGAAATCTTTGA 

CTGTGGGTAGTTATTTATTTCTGAATAAGAGCGTCCACGCATCATGGACCTCGCGGGACTGC 

TGAAGTCTCAGTTCCTGTGCCACCTGGTCTTCTGCTACGTCTTTATTGCCTCAGGGCTAATC 

ATO^O^CCATTCAGCTCTTCACTCTCCTCCrCTGGCCCATTAAa^AGCAGCTCTTCC^ 

GATCAACTGCAGACTGTCCTATTGCATCTCAAGCCAGCTGGTGATGCTGCTGGAGTGGTGGT 

CGGGCACGGAATGCACCATCTTCACGGACCCGCGCGCCTACCTCAAGTATGGGAAGGAAAAT 

GCCATCGTGGTTCTCAACCACAAGTTTGAAATTGACTTTCTGTGTGGCTGGAGCCTGTCCGA 

ACGCTTTGGGCTGTTAGGGGGCTCCAAGGTCCTGGCCAAGAAAGAGCTGGCCTATGTCCCAA 

TTATCGGCTGGATGTGGTACTTCACCGAGATGGTCTTCTGTTCGCGCAAGTGGGAGCAGGAT 

CGCAAGACGGTTGCCACCAGTTTGCAGCACCTCCGGGACTACCCCGAGAAGTATTTTTTCCT 

GATTCACTGTGAGGGCACACGGTTCACGGAGAAGAAGCATGAGATCAGCATGCAGGTGGCCC 

GGGCCAAGGGGCTGCCTCGCCTCAAGCATCACCTGTTGCCACGAACCAAGGGCTTCGCCATC 

ACCGTGAGGAGCTTGAGAAATGTAGTTTCAGCTGTATATGACTGTACACTCAATTTCAGAAA 

TAATGAAAATCCAACACTGCTGGGAGTCCTAAACGGAAAGAAATACCATGCAGATTTGTATG 

TTAGGAGGATCCCACTGGAAGACATCCCTGAAGACGATGACGAGTGCTCGGCCTGGCTGCAC 

AAGCTCTACCAGGAGAAGGATGCCTTTCAGGAGGAGTACTACAGGACGGGCACCTTCCCAGA 

GACGCCCATGGTGCCCCCCCGGCGGCCCTGGACCCTCGTGAACTGGCTGTTTTGGGCCTCGC 

TGGTGCTCTACCCTTTCTTCCAGTTCCTGGTCAGCATGATCAGGAGCGGGTCTTCCCTGACG 

CTGGCCAGCTTCATCCTCGTCTTCTTTGTGGCCTCCGTGGGAGTTCGATGGATGATTGGTGT 

GACGGAAATTGACAAGGGCTCTGCCTACGGCAACTCTGACAGCAAGCAGAAACTGAATGACI 

GACTCAGGGAGGTGTCACCATCCGAAGGGAACCTTGGGGAACTGGTGGCCTCTGCATATCCT 

CCTTAGTGGGACACGGTGACAAAGGCTGGGTGAGCCCCTGCTGGGCACGGCGGAAGTCACGA 

CCTCTCCAGCCAGGGAGTCTGGTCTCAAGGCCGGATGGGGAGGAAGATGTTTTGTAATCTTT 

TTTTCCCCATGTGCTTTAGTGGGCTTTGGTTTTCTTTTTGTGCGAGTGTGTGTGAGAATGGC 

TGTGTGGTGAGTGTGAACTTTGTTCTGTGATCATAGAAAGGGTATTTTAGGCTGCAGGGGAG 

GGCAGGGCTGGGGACCGAAGGGGACAAGTTCCCCTTTCATCCTTTGGTGCTGAGTTTTCTGT 

AACCCTTGGTTGCCAGAGATAAAGTGAAAAGTGCTTTAGGTGAGATGACTAAATTATGCCTC 

CAAGAAAAAAAAATTAAAGTGCTTTTCTGGGTCAAAAAAAAAAAA 
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MDLAGLLKSQFLCHLVFCYVFIASGLIINTIQLFTLLLWPINKQLFRKINCRLSYCISSQLV 

MLLEWWSGTECTIFTDPRAYLKYGKENAIVVUJHKFEIDFLCGWSLSERFGLLGGSKVLAKK 

ELAYVPIIGWMWYFTEMVFCSRKWEQDRKTVATSLQHLRDYPEKYFFLIHCEGTRFTEKKHE 

ISMQVARAKGLPRLKHHLLPRTKGFAITVRSLRNWSAVYDCTLNFRNNENPTLLGVLNGKK 

YHADLYVRRI PLEDI PEDDDECSAWLHKLYQEKDAFQEEYYRTGTFPETPMVPPRRPWTLVN 

WLFWASLVLYPFFQFLVSMIRSGSSLTLASFILVFFVASVGVRWMIGVTEIDKGSAYGNSDS 
KQPQiND 
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FIGURE 94 

CTGAGGCGGCGGTAGCATGGAGGGGGAGAGTACGTCGGCGGTGCTCTCGGGCTTTGTGCTCG 

GCGCACTCGCTTTCCAGCACCTCAACACGGACTCGGACACGGAAGGTTTTCTTCTTGGGGAA 

GTAAAAGGTGAAGCCAAGAACAGCATTACTGATTCCCAAATGGATGATGTTGAAGTTGTTTA 

TACAATTGACATTCAGAAATATATTCCATGCTATCAGCTTTTTAGCTTTTATAATTCTTCAG 

GCGAAGTAAATGAGCAAGCACTGAAGAAAATATTATCAAATGTCAAAAAGAATGTGGTAGGT 

TGGTACAAATTCCGTCGTCATTCAGATCAGATCATGACGTTTAGAGAGAGGCTGCTTCACAA 

AAACTTGCAGGAGCATTTTTCAAACCAAGACCTTGTTTTTCTGCTATTAACACCAAGTATAA 

TAACAGAAAGCTGCTCTACTCATCGACTGGAACATTCCTTATATAAACCTCAAAAAGGACTT 

TTTCACAGGGTACCTTTAGTGGTTGCCAATCTGGGCATGTCTGAACAACTGGGTTATAAAAC 

TGTATCAGGTTCCTGTATGTCCACTGGTTTTAGCCGAGCAGTACAAACACACAGCTCTAAAT 

TTTTTGAAGAAGATGGATCCTTAAAGGAGGTACATAAGATAAATGAAATGTATGCTTCATTA 

CAAGAGGAATTAAAGAGTATATGCAAAAAAGTGGAAGACAGTGAACAAGCAGTAGATAAACT 

AGTAAAGGATGTAAACAGATTAAAACGAGAAATTGAGAAAAGGAGAGGAGCACAGATTCAGG 

CAGCAAGAGAGAAGAACATCCAAAAAGACCCTCAGGAGAACATTTTTCTTTGTCAGGCATTA 

CGGACCTTTTTTCCAAATTCTGAATTTCTTCATTCATGTGTTATGTCTTTAAAAAATAGACA 

TGTTTCTAAAAGTAGCTGTAACTACAACCACCATCTCGATGTAGTAGACAATCTGACCTTAA 

TGGTAGAACACACTGACATTCCTGAAGCTAGTCCAGCTAGTACACCACAAATCATTAAGCAT 

AAAGCCTTAGACTTAGATGACAGATGGCAATTCAAGAGATCTCGGTTGTTAGATACACAAGA 

CAAACGATCTAAAGCAAATACTGGTAGTAGTAACCAAGATAAAGCATCCAAAATGAGCAGCC 

CAGAAACAGATGAAGAAATTGAAAAGATGAAGGGTTTTGGTGAATATTCACGGTCTCCTACA 

TTTTG&TCCTTTTAACCTTACAAGGAGATTTTTTTATTTGGCTGATGGGTAAAGCCAAACAT 

TTCTATTGTTTTTACTATGTTGAGCTACTTGCAGTAAGTTCATTTGTTTTTACTATGTTCAC 

CTGTTTGCAGTAATACACAGATAACTCTTAGTGCATTTACTTCACAAAGTACTTTTTCAAAC 

ATCAGATGCTTTTATTTCCAAACCTTTTTTTCACCTTTCACTAAGTTGTTGAGGGGAAGGCT 

TACACAGACACATTCTTTAGAATTGGAAAAGTGAGACCAGGCACAGTGGCTCACACCTGTAA. 

TCCCAGCACTTAGGGAAGACAAGTCAGGAGGATTGATTGAAGCTAGGAGTTAGAGACCAGCC 

TGGGCAACGTATTGAGACCATGTCTATTAAAAAATAAAATGGAAAAGCAAGAATAGCCTTAT 

TTTCAAAATATGGAAAGAAATTTATATGAAAATTTATCTGAGTCATTAAAATTCTCCTTAAG 

TGATACTTTTTTAGAAGTACATTATGGCTAGAGTTGCCAGATAAAATGCTGGATATCATGCA 

ATAAATTTGCAAAACATCATCTAAAATTTAAAAAAAAAAAAAAAAAAAAAA 
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MEGESTSAVLSGFVIX3A1APQHLNTDSDTEGFLLGEVKGEAKNSITDSQMDDVEVVYTIDIQ 
KYIPCYQLFSFYNSSGEVNEQALKKILSNVKKNWGWYKFRRHSDQIMTFRERLLHKNLQEH 
FSNQDLVFLLLTPSIITESCSTHRLEHSLYKPQKGLFHRVPLWANLGMSEQLGYKTVSGSC 
MSTGFSRAVQTHSSKFFEEDGSLKEVHKINEMYASLQEELKSICKKVEDSEQAVDKLVKDVN 
RLKREIEKRRGAQIQAAREKNIQKDPQENIPLCQALRTFFPNSEFLHSCVMSLKNRHVSKSS 
CNYNHHLDWDNLTLMVEHTDI PEASPASTPQI I KHKALDLDDRWQFKRSRLLDTQDKRSKA 
NTGSSNQDKASKMSSPETDEEIEKMKGFGEYSRSPTF 



wo 99/63088 



5 3 A 



vo 



PCTAJS99/12252 



FIGURE 96 

GGCACAGCCGCGCGGCGGAGGGCAGAGTCAGCCGAGCCGAGTCCAGCCGGACGAGCGGACCA 

GCGCAGGGCAGCCCAAGCAGCGCGCAGCGAACGCCCGCCGCCGCCCACACCCTCTGCGGTCC 

CCGCGGCGCCTGCCACCCTTCCCTCCTTCCCCGCGTCCCCGCCTCGCCGGCCAGTCAGCTTG 

CCGGGTTCGCTGCCCCGCGAAACCCCGAGGTCACCAGCCCGCGCCTCTGCTTCCCTGGGCCG 

CGCGCCGCCTCCACGCCCTCCTTCTCCCCTGGCCCGGCGCCTGGCACCGGGGACCGTTGCCT 

GACGCGAGGCCCAGCTCTACTTTTCGCCCCGCGTCTCCTCCGCCTGCTCGCCTCTTCCACCA 

ACTCCAACTCCTTCTCCCTCCAGCTCCACTCGCTAGTCCCCGACTCCGCCAGCCCTCGGCCC 

GCTGCCGTAGCGCCGCTTCCCGTCCGGTCCCAAAGGTGGGAACGCGTCCGCCCCGGCCCGCA 

CCATSGCACGGTTCGGCTTGCCCGCGCTTCTCTGCACCCTGGCAGTGCTCAGCGCCGCGCTG 

CTGGCTGCCGAGCTCAAGTCGAAAAGTTGCTCGGAAGTGCGACGTCTTTACGTGTCCAAAGG 

CTTCAACAAGAACGATGCCCCCCTCCACGAGATCAACGGTGATCATTTGAAGATCTGTCCCC 

AGGGTTCTACCTGCTGCTCTCAAGAGATGGAGGAGAAGTACAGCCTGCAAAGTAAAGATGAT 

TTCAAAAGTGTGGTCAGCGAACAGTGCAATCATTTGCAAGCTGTCTTTGCTTCACGTTACAA 

GAAGTTTGATGAATTCTTCAAAGAACTACTTGAAAATGCAGAGAAATCCCTGAATGATATGT 

TTGTGAAGACATATGGCCATTTATACATGCAAAATTCTGAGCTATTTAAAGATCTCTTCGTA 

GAGTTGAAACGTTACTACGTGGTGGGAAATGTGAACCTGGAAGAAATGCTAAATGACTTCTG 

GGCTCGCCTCCTGGAGCGGATGTTCCGCCTGGTGAACTCCCAGTACCACTTTACAGATGAGT 

ATCTGGAATGTGTGAGCAAGTATACGGAGCAGCTGAAGCCCTTCGGAGATGTCCCTCGCAAA 

TTGAAGCTCCAGGTTACTCGTGCTTTTGTAGCAGCCCGTACTTTCGCTCAAGGCTTAGCGGT 

TGCGGGAGATGTCGTGAGCAAGGTCTCCGTGGTAAACCCCACAGCCCAGTGTACCCATGCCC 

TGTTGAAGATGATCTACTGCTCCCACTGCCGGGGTCTCGTGACTGTGAAGCCATGTTACAAC 

TACTGCTCAAACATCATGAGAGGCTGTTTGGCCAACCAAGGGGATCTCGATTTTGAATGGAA 

CAATTTCATAGATGCTATGCTGATGGTGGCAGAGAGGCTAGAGGGTCCTTTCAACATTGAAT 

CGGTCATGGATCCCATCGATGTGAAGATTTCTGATGCTATTATGAACATGCAGGATAATAGT 

GTTCAAGTGTCTCAGAAGGTTTTCCAGGGATGTGGACCCCCCAAGCCCCTCCCAGCTGGACG 

AATTTCTCGTTCCATCTCTGAAAGTGCCTTCAGTGCTCGCTTCAGACCACATCACCCCGAGG 

AACGCCCAACCACAGCAGCTGGCACTAGTTTGGACCGACTGGTTACTGATGTCAAGGAGAAA 

CTGAAACAGGCCAAGAAATTCTGGTCCTCCCTTCCGAGCAACGTTTGCAACGATGAGAGGAT 

GGCTGCAGGAAACGGCAATGAGGATGACTGTTGGAATGGGAAAGGCAAAAGCAGGTACCTGT 

TTGCAGTGACAGGAAATGGATTAGCCAACCAGGGCAACAACCCAGAGGTCCAGGTTGACACC 

AGCAAACCAGACATACTGATCCTTCGTCAAATCATGGCTCTTCGAGTGATGACCAGCAAGAT 

GAAGAATGCATACAATGGGAACGACGTGGACTTCTTTGATATCAGTGATGAAAGTAGTGGAG 

AAGGAAGTGGAAGTGGCTGTGAGTATCAGCAGTGCCCTTCAGAGTTTGACTACAATGCCACT 

GACCATGCTGGGAAGAGTGCCAATGAGAAAGCCGACAGTGCTGGTGTCCGTCCTGGGGCACA 

GGCCTACCTCCTCACTGTCTTCTGCATCTTGTTCCTGGTTATGCAGAGAGAGTGGAGATAAT 

TCTCAAACTCTGAGAAAAAGTGTTCATCAAAAAGTTAAAAGGCACCAGTTATCACTTTTCTA 

CCATCCTAGTGACTTTGCTTTTTAAATGAATGGACAACAATGTACAGTTTTTACTATGTGGC 

CACTGGTTTAAGAAGTGCTGACTTTGTTTTCTCATTCAGTTTTGGGAGGAAAAGGGACTGTG 

CATTGAGTTGGTTCCTGCTCCCCCAAACCATGTTAAACGTGGCTAACAGTGTAGGTACAGAA 

CTATAGTTAGTTGTGCATTTGTGATTTTATCACTCTATTATTTGTTTGTATGTTTTTTTCTC 

ATTTCGTTTGTGGGTTTTTTTTTCCAACTGTGATCTCGCCTTGTTTCTTACAAGCAAACCAG 

GGTCCCTTCTTGGCACGTAACATGTACGTATTTCTGAAATATTAAATAGCTGTACAGAAGCA 

GGTTTTATTTATCATGTTATCTTATTAAAAGAAAAAGCCCAAAAAGC — " 
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MARFGLPALLCTIAVLSAALIJ^LKSKSCSEVRRLYVSKGFNKNDAPLHEINGDHLKICPQ 

GSTCCSQEMEEKYSLQSKDDFKSVVSEQCNHLQAVFASRYKKFDEFFKELLENAEKSLNDMF 

VKTYGHLYMQNSELFKDLFVELKRYYVVGNVNLEEMLNDFWARLLERMFRLVNSQYHFTDEY 

LECVSKYTEQLKPFGDVPRKLKLQVTRAFVAARTFAQGLAVAGDWSKVSVVNPTAQCTHAL 

LKMIYCSHCRGLVTVKPCYNYCSNIMRGCLANQGDLDFEWNNFIDAMLMVAE^ 

VMDPIDVKISDAIMNMQDNSVQVSQKVFQGCGPPKPLPAGRISRSISESAFSARFRPHHPEE 

RPTTAAGTSLDRLVTDVKEKLKQAKKFWSSLPSNVCNDERMAAGNGNEDDCWNGKGKSRYLF 

AVTGNGIJUsTQGNNPEVQVDTSKPDILILRQIMALRVMTSKMKNAYNGNDVDFFDIS 

GSGSGCEYQQCPSEFDYNATDHAGKSANEKADSAGVRPGAQAYLLTVFCILFLVMQREWR 
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FIGURE 98 

CTCGCCCTCAAATGGGAACGCTGGCCTGGGACTAAAGCATAGACCACCAGGCTGAGTATCCT 

GACCTGAGTCATCCCCAGGGATCAGGAGCCTCCAGCAGGGAACCTTCCATTATATTCTTCAA 

GCTACTTACAGCTGCACCGACTVGTTGCGATGAAAGTTCTAATCTCTTCCCT 

TGCCACTAATGCTGATGTCCATGGTCTCTAGCAGCCTGAATCCAGGGGTCGCCAGAGGCCAC 

AGGGACCGAGGCCAGGCTTCTAGGAGATGGCTCCAGGAAGGCGGCCAAGAATGTGAGTGCAA 

AGATTGGTTCCTGAGAGCCCCGAGAAGAAAATTCATGACAGTGTCTGGGCTGCCAAAGAAGC 

AGTGCCCCTGTGATCATTTCAAGGGCAATGTGAAGAAAACAAGACACCAAAGGCACCACAGA 

AAGCCAAACAAGCATTCCAGAGCCTGCCAGCAATTTCTCAT^CAATGTCAGCTAAGAAGCTT 

TGCTCTGCCTTTGTAGGAGCTCTGAGCGCCCACTCTTCCAATTAAACATTCTCAGCCAAGAA 

GACAGTGAGCACACCTACCAGACACTCTTCTTCTCCCACCTCACTCTCCCACTGTACCCACC 

CCTAAATCATTCCAGTGCTCTCAAAAAGCATGTTTTTCAAGATCATTTTGTTTGTTGCTCTC 

TCTAGTGTCTTCTTCTCTCGTCAGTCTTAGCCTGTGCCCTCCCCTTACCCAGGCTTAGGCTT 

AATTACCTGAAAGATTCCAGGAAACTGTAGCTTCCTAGCTAGTGTCATTTAACCTTAAATGC 

AATCAGGAAAGTAGCAAACAGAAGTCAATAAATATTTTTAT^TGTCAAT^AAAAAT 
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FIGURE 99 

MKVLISSLLLLLPLMLMSMVSSSLNPGVARGHRDRGQASRRWLQEGGQECECKDWFLRAPRR 
KFMTVSGLPKKQCPCDHFKGNVKKTRHQRHHRKPNKHSRACQQFLKQCQLRSFALPL 
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A ATG GCTGTCTTAGTACTTCGCCTGACAGTTGTCCTGGGACTGCTTGTCTTATTCCTGACCT 
GCTATGCAGACGACAAACCAGACAAGCCAGACGACAAGCCAGACGACTCGGGCAAAGACCCA 
AAGCCAGACTTCCCCAAATTCCTAAGCCTCCTGGGCACAGAGATCATTGAGAATGCAGTCGA 
GTTCATCCTCCGCTCCATGTCCAGGAGCACAGGATTTATGGAATTTGATGATAATGAAGGAA 
AACATTCATCAAAGTGACATCCTCAGGACACACCCATGTGGCTCCTGGACAATCCAAGAGCA 
GCCAAATCCTGCTTTTCCAGTTTGGCTCCACAAGTCCTCCAGGACAGAGCCCTCAAAGCAAC 
TCCCAACGAGTTCTCAGGATTCAGGCTCTGGCTTCAACCAAACAGAACTCATTTTGAACACC 
CTGACTGCATTTTTGCTTTTAGAAAGTTAGAATAAATATGGCGCTTTGGGATCACATAGTTG 
ATGGAGAGGAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 101 

r^VLVLRLTVVLGLLVLFLTCYADDKPDKPDDKPDDSGKDPKPDFPKFLSLLGTE 1 1 ENAVE 
FILRSMSRSTGFMEFDDNEGKHSSK 
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FIGURE 102 

GGACGCCAGCGCCTGCAGAGGCTGAGCAGGGAAAAAGCCAGTGCCCCAGCGGAAGCACAGCT 
CAGAGCTGGTCTGCCATGGACATCCTGGTCCCACTCCTGCAGCTGCTGGTGCTGCTTCTTAC 
CCTGCCCCTGCACCTCATGGCTCTGCTGGGCTGCTGGCAGCCCCTGTGCAAAAGCTACTTCC 
CCTACCTGATGGCCGTGCTGACTCCCAAGAGCAACCGCAAGATGGAGAGCAAGAAACGGGAG 
CTCTTCAGCCAGATAAAGGGGCTTACAGGAGCCTCCGGGAAAGTGGCCCTACTGGAGCTGGG 
CTGCGGAACCGGAGCCAACTTTCAGTTCTACCCACCGGGCTGCAGGGTCACCTGCCTAGACC 
CAAATCCCCACTTTGAGAAGTTCCTGACAAAGAGCATGGCTGAGAACAGGCACCTCCAATAT 
GAGCGGTTTGTGGTGGCTCCTGGAGAGGACATGAGACAGCTGGCTGATGGCTCCATGGATGT 
GGTGGTCTGCACTCTGGTGCTGTGCTCTGTGCAGAGCCCAAGGAAGGTCCTGCAGGAGGTCC 
GGAGAGTACTGAGACCGGGAGGTGTGCTCTTTTTCTGGGAGCATGTGGCAGAACCATATGGA 
AGCTGGGCCTTCATGTGGCAGCAAGTTTTCGAGCCCACCTGGAAACACATTGGGGATGGCTG 
CTGCCTCACCAGAGAGACCTGGAAGGATCTTGAGAACGCCCAGTTCTCCGAAATCCAAATGG 
AACGACAGCCCCCTCCCTTGAAGTGGCTACCTGTTGGGCCCCACATCATGGGAAAGGCTGTC 
AAACAATCTTTCCCAAGCTCCAAGGCACTCATTTGCTCCTTCCCCAGCCTCCAATTAGAACA 
AGCCACCCACCAGCCTATCTATCTTCCACTGAGAGGGACCTAGCAGAATGAGAGAAGACATT 
CATGTACCACCTACTAGTCCCTCTCTCCCCAACCTCTGCCAGGGCAATCTCTAACTTCAATC 
CCGCCTTCGACAGTGAAAAAGCTCTACTTCTACGCTGACCCAGGGAGGAAACACTAGGACCC 
TGTTGTATCCTCAACTGCAAGTTTCTGGACTAGTCTCCCAACGTTTGCCTCCCAATGTTGTC 
CCTTTCCTTCGTTCCCATGGTAAAGCTCCTCTCGCTTTCCTCCTGAGGCTACACCCATGCGT 
CTCTAGGAACTGGTCACAAAAGTCATGGTGCCTGCATCCCTGCCAAGCCCCCCTGACCCTCT 
CTCCCCACTACCACCTTCTTCCTGAGCTGGGGGCACCAGGGAGAATCAGAGATGCTGGGGAT 
GCCAGAGCAAGACTCAAAGAGGCAGAGGTTTTGTTCTCAAATATTTTTTAATAAATAGACGA 
AACCACG 
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FIGURE 103 

IVDILVPLLQLLVLLLTLPLHIJ^LGCWQPLCKSYFPYIJ^AVLTPKS^ 
KGLTGASGKVALLELGCGTGANFQFYPPGCRVTCLDPNPHFEKFLTKSMAENRHLQYERFVV 
APGEDMRQLADGSMDVWCTLVLCSVQSPRKVLQEVRRVLRPGGVLFFWEHV^ 
WQQVFEPTWKHIGDGCCLTRETWKDLENAQFSEIQMERQPPPLKWLPVGPHIMGKAVKQSFP 
SSKALI CSFPSLQLEQATHQPIYLPLRGT 
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FIGURE 104 

GTGGGATTTATTTGAGTGCAAGATCGTTTTCTCAGTGGTGGTGGAAGTTGCCTCATCGCAGG 
CAGATGTTGGGGCTTTGTCCGAACAGCTCCCCTCTGCCAGCTTCTGTAGATAAGGGTTAAAA 
ACTAATATTTATATGACAGAAGAAAAAGATGTCATTCCGTAAAGTAAACATCATCATCTTGG 
TCCTGGCTGTTGCTCTCTTCTTACTGGTTTTGCACCATAACTTCCTCAGCTTGAGCAGTTTG 
TTAAGGAATGAGGTTACAGATTCAGGAATTGTAGGGCCTCAACCTATAGACTTTGTCCCAAA 
TGCTCTCCGACATGCAGTAGATGGGAGACAAGAGGAGATTCCTGTGGTCATCGCTGCATCTG 
AAGACAGGCTTGGGGGGGCCATTGCAGCTATAAACAGCATTCAGCACAACACTCGCTCCAAT 
GTGATTTTCTACATTGTTACTCTCAACAATACAGCAGACCATCTCCGGTCCTGGCTCAACAG 
TGATTCCCTGAAAAGCATCAGATACAAAATTGTCAATTTTGACCCTAAACTTTTGGAAGGAA 
AAGTAAAGGAGGATCCTGACCAGGGGGAATCCATGAAACCTTTAACCTTTGCAAGGTTCTAC 
TTGCCAATTCTGGTTCCCAGCGCAAAGAAGGCCATATACATGGATGATGATGTAATTGTGCA 
AGGTGATATTCTTGCCCTTTACAATACAGCACTGAAGCCAGGACATGCAGCTGCATTTTCAG 
AAGATTGTGATTCAGCCTCTACTAAAGTTGTCATCCGTGGAGCAGGAAACCAGTACAATTAC 
ATTGGCTATCTTGACTATAAAAAGGAT^GAATTCGTAAGCTTTCCATGAAAGCCAGCACTTG 
CTCATTTAATCCTGGAGTTTTTGTTGCAAACCTGACGGAATGGAAACGACAGAATATAACTA 
ACCAACTGGAAAAATGGATGAAACTCAATGTAGAAGAGGGACTGTATAGCAGAACCCTGGCT 
GGTAGCATCACAACACCTCCTCTGCTTATCGTATTTTATCAACAGCACTCTACCATCGATCC 
TATGTGGAATGTCCGCCACCTTGGTTCCAGTGCTGGAAAACGATATTCACCTCAGTTTGTAA 
AGGCTGCCAAGTTACTCCATTGGAATGGACATTTGAAGCCATGGGGAAGGACTGCTTCATAT 
ACTGATGTTTGGGAAAAATGGTATATTCCAGACCCAACAGGCAAATTCAACCTAATCCGAAG 
ATATACCGAGATCTCAAACATAAAGTGAAACAGAATTTGAACTGTAAGCAAGCATTTCTCAG 
GAAGTCCTGGAAGATAGCATGCATGGGAAGTAACAGTTGCTAGGCTTCAATGCCTATCGGTA 
GCAAGCCATGGAAAAAGATGTGTCAGCTAGGTAAAGATGACAAACTGCCCTGTCTGGCAGTC 
AGCTTCCCAGACAGACTATAGACTATAAATATGTCTCCATCTGCCTTACCAAGTGTTTTCTT 
ACTACAATGCTGAATGACTGGAAAGAAGAACTGATATGGCTAGTTCAGCTAGCTGGTACAGA 
TAATTCAAAACTGCTGTTGGTTTTT^TTTTGTAACCTGTGGCCTGATCTGTAAATAAAACTT 
ACATTTTTC 
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FIGURE 105 



MSFRKVNI I ILVLAVALFLLVLHHNFLSLSSLLRNEVTDSGI VGPQPIDFVPNALRHAVDGR 
QEEIPWIAASEDRLGGAIAAINSIQHNTRSNVIFYIVTLNNTADHLRSWLNSDSLKSIRYK 
IVNFDPKLLEGKVKEDPDQGESMKPLTFARFYLPILVPSAKKAIYMDDDVIVQGDILALYNT 
ALKPGHAAAFSEDCDSASTKWIRGAGNQYNYIGYLDYKKERIRKLSMKASTCSFNPGVFVA 
NLTEWKRQNITNQLEKWMKLNVEEGLYSRTLAGSITTPPLLIVFYQQHSTIDPMWNVRHLGS 
SAGKRYSPQFVKAAKLLHWNGHLKPWGRTASYTDVWEKWYIPDPTGKFNLIRRYTEISNIK 
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FIGURE 106 

TGGTTTTTGCCCCATAAATTCCCTCAGCTTGAGCAGTTTGTTAAGGAATGAGGTTACAGATT 

CAGGAATTOTAGGNCCTCAACCTNTAGANTTTGTCCCAAATGTTCrCCGACATGC^ 

GGGAGACAAGAGGAGATTCCTGTGGTCATCGCTGCATNTGAAGACAGGCTTGGGGGGGCCAT 

TGCAGCTATAAACAGCATTCAGCACAACACTCGNTCCAATGTGATTTTCTACATTGTTACT 

TCAACAATACAGCAGACCATNTCCGGTCCTGGNTCAACAGTGATTCCCTGAAAAGCATCAGA 

TACAAAATTGTCAATTTTGACCCTAAACTTTTGGAAGGAAAAGTAAAGGAGGATCCTGACCA 

GGGGGAATCCATGAAACCTTTAACCTTTGCAAGGTTCTACTTGCCAATTCTGGTTCCCAGCG 

CAAAGAAGGCCATATACATGGATGATGATGTAATTGTGCAAGGTGATATTCTTGCCCTTTAC 

AATACAGCACTGAAGCCAGGACATGCAGCTGCATTTTCAGAAGATTGTGATTCAGCCTCTAC 

TAAAGTTGTCATCCGTGGAGCAGGAAA 
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FIGURE 107 

CGACGCTCTAGCGGTTACCGCTGCGGGCTGGCTGGGCGTAGTGGGGCTGCGCGGCTGCCACG 

GAGCTAGAGGGCAAGTGTGCTCGGCCCAGCGTGCAGGGAACGCGGGCGGCCAGACAACGGGC 

TGGGCTCCGGGGCCTGCGGCGCGGGCGCTGAGCTGGCAGGGCGGGTCGGGGCGCGGGCTGCA 

TCCGCATCrCCTCCATCGCCTGCAGTAAGGGCGGCCGCGGCGAGCCTTTGAGGGGAAC^ 

TGTCGGAGCCCTAACCAGGGGTGTCTCTGAGCCTGGTGGGATCCCCGGAGCGTCACATCACT 

TTCCGATCACTTCAAAGTGGTTAAAAACTAATATTTATATGACAGAAGAAAAAGATGTCATT 

CCGTAAAGTAAACATCATCATCTTGGTCCTGGGCTGTTGCTCTCTTCTTACTGGTTTTGCAC 

CATAACTTCCTCAGCTTGAGGCAGTTTGTTAAGGAATGAGGTTACAGATTCAGGAATTGTAG 

GGCCTCAACCTATAGGACTTTGTCCCAAATGCTCTCCGACATGCAGTAGATGGGAGACAAGA 

GGAGATTCCTGTGGTCATCGCTGCATCTGAAGACAGGCTTGGGGGGGCCATTGCAGCTATAA 

ACAGCATTCAGCACAACACTCGCTCCAATGTGATTTTCTACATTGTTACTCTCAACAATACA 

GCAGACCATCTCCGGTCCTGGGCTCAACAGTGATTCCCTG7VAAAGCATCAGATACAAAATTG 

TCAATTTTGACCCTAAACTTTTGGAAGGAAAAGTAAAGGAGGATCCTGACCAGGGGGAATCC 

ATGAAACCTTTAACCTTTGCAAGGTTCTACTTGCCAATTCTGGGTTCCCAGCGCAAAGAAGG 

CCATATACATGGATGATGATGTAATTGTGCAAGGTGATATTCTTGCCCTTTACAATACAGCA 

CTGAAGCCAGGACATGCAGCTGCATTTTCAGAAGATTGTGATTCAGCCTCTACTAAAGTTGT 

CATCCGTGGAGCAGGAAACCAGTACAATTACATTGGCTATCTTGACTATAAAAAGGAAAGAA 

TTCGTAAGCTTTCCATGAAAGCCAGCACTTGCTCATTTAATCCTGGAGTTTTTGTTGCAAAC 

CTGACGGAATGGAAACGACAGAATATAACTAACCAACTGGAAAAATGGATGAAACTCAATGT 

AGAAGAGGGACTGTATAGCAGAACCCTGGCTGGTAGCATCACAACACCTCCTCTGCTTATCG 

TATTTTATCAACAGCACTCTACCATCGATCCTATGTGGAATGTCCGCCACCTTGGTTCCAGT 

GCTGGAAAACGATATTCACCTCAGTTTGTAAAGGCTGCCAAGTTACTCCATTGGAATGGACA 

TTTGAAGCCATGGGGAAGGACTGCTTCATATACTGATGTTTGGGGAAAAATGGTATATTCCA 

GACCCAACAGGCAAATTCAACCTAATCCGAAGATATACCGAGATCTCAAACATAAAGTGAAA 

CAGAATTTGAACTGTAAGCAAGCATTTCTCAGGAAGTCCTGGAAGATAGCATGCGTGGGAAG 

TAACAGTTGCTAGGCTTCAATGCCTATCGGTAGCAAGCCATGGAAAAAGATGTGTCAGCTAG 

GTAAAGATGACAAACTGCCCTGTCTGGCAGTCAGCTTCCCAGACAGACTATAGACTATAAAT 

ATGTCTCCATCTGCCTTACCAAGTGTTTTCTTACTACAATGCTGAATGACTGGAAAGAAGAA 

CTGATATGGCTAGTTCAGCTAGCTGGTACAGATAATTCAAAACTGCTGTTGGTTTTAATTTT 

GTAACCTGTGGCCTGATCTGTAAATAAAACTTACATTTTTCAATAGGTAAAAAAAAAAAAAA 

AAAAAA 
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FTGTJRE 108 

CTGCAGGTAGACATCTCCACTGCCCAGGAATCACTGAGCGTGCAGACAGCACAGCCTCCTCT 

GAAGGCCGGCCATACCAGAGTCCTGCCTCGGCATGGGCCTCACCATTGAGGCAGCTCCACTG 

TCTGTGCTGGTCTGAGGGTGCTGCCTGTCATGGGGGCAGCCATCTCCCAGGGGGCCCTCATC 

GCCATCGTCTGCAACGGTCTCGTGGGCTTCTTGCTGCTGCTGCTCTGGGTCATCCTCTGCTG 

GGCCTGCCATTCTCGTCTGCCGACGTTGACTCTCTCTCTGAATCCAGTCCCAACTCCAGCCC 

TGGCCCCTGTCCTGAGAAGGCCCCACCACCCCAGAAGCCCAGCCATGAAGGCAGCTACCTGC 

TGCAGCCCTGAAGGCCCCTGGCCTAGCCTGGAGCCCAGGACCTAAGTCCACCTCACCTAGAG 

CCTGGAATTAGGATCCCAGAGTTCAGCCAGCCTGGGGTCCAGAACTCAAGAGTCCGCCTGCT 

TGGAGCTGGACCCAGCGGCCCAGAGTCTAGCCAGCTTGGCTCCAATAGGAGCTCAGTGGCCC 

TAAGGAGATGGGCCTGGGGTGGGGGCTTATGAGTTGGTGCTAGAGCCAGGGCCATCTGGACT 

ATGCTCCATCCCAAGGGCCAAGGGTCAGGGGCCGGGTCCACTCTTTCCCTAGGCTGAGCACC 

TCTAGGCCCTCTAGGTTGGGGAAGCAAACTGGAACCCATGGCAATAATAGGAGGGTGTCCAG 

GCTGGGCCCCTCCCCTGGTCCTCCCAGTGTTTGCTGGATAATAAATGGAACTATGGCTCTAA 

AAAAAAAAAAAAAAAAA 
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FIGURE 109 

MGAAISQGALIAIVCNGLVGFLLLLLWVILCWACHSRLPTLTLSLNPVPTPALAPVLRRPHH 
PRSPAMKAATCCSPEGPWPSLEPRT 



wo 99/63088 / PCTAJS99/122S2 

FIGURE 110 

GTTTGAATTCCTTCAACTATACCCACAGTCCAAAAGCAGACTCACTGTGTCCCAGGCTACCA 
GTTCCTCCAAGCAAGTCATTTCCCTTATTTAACCGATGTGTCCCTCAAACACCTGAGTGCTA 
CTCCCTATTTGCATCTGTTTTGATAAATGATGTTGACACCCTCCACCGAATTCTAAGTGGAA 
TCATGTCGGGAAGAGATACAATCCTTGGCCTGTGTATCCTCGCATTAGCCTTGTCTTTGGCC 
ATGATGTTTACCTTCAGATTCATCACCACCCTTCTGGTTCACATTTTCATTTCATTGGTTAT 
TTTGGGATTGTTGTTTGTCTGCGGTGTTTTATGGTGGCTGTATTATGACTATACCAACGACC 
TCAGCATAGAATTGGACACAGAAAGGGAAAATATGAAGTGCGTGCTGGGGTTTGCTATCGTA 
TCCACAGGCATCACGGCAGTGCTGCTCGTCTTGATTTTTGTTCTCAGAAAGAGAATAAAATT 
GACAGTTGAGCTTTTCCAAATCACAAATAAAGCCATCAGCAGTGCTCCCTTCCTGCTGTTCC 
AGCCACTGTGGACATTTGCCATCCTCATTTTCTTCTGGGTCCTCTGGGTGGCTGTGCTGCTG 
AGCCTGGGAACTGCAGGAGCTGCCCAGGTTATGGAAGGCGGCCAAGTGGAATATAAGCCCCT 
TTCGGGCATTCGGTACATGTGGTCGTACCATTTAATTGGCCTCATCTGGACTAGTGAATTCA 
TCCTTGCGTGCCAGCAAATGACTATAGCTGGGGCAGTGGTTACTTGTTATTTCAACAGAAGT 
AAAAATGATCCTCCTGATCATCCCATCCTTTCGTCTCTCTCCATTCTCTTCTTCTACCATCA 
AGGAACCGTTGTGAAAGGGTCATTTTTAATCTCTGTGGTGAGGATTCCGAGAATCATTGTCA 
TGTACATGCAAAACGCACTGAAAGAACAGCAGCATGGTGCATTGTCCAGGTACCTGTTCCGA 
TGCTGCTACTGCTGTTTCTGGTGTCTTGACAAATACCTGCTCCATCTCAACCAGAATGCATA 
TACTACAACTGCTATTAATGGGACAGATTTCTGTACATCAGCAAAAGATGCATTCAAAATCT 
TGTCCAAGAACTCAAGTCACTTTACATCTATTAACTGCTTTGGAGACTTCATAATTTTTCTA 
GGAAAGGTGTTAGTGGTGTGTTTCACTGTTTTTGGAGGACTCATGGCTTTTAACTACAATCG 
GGCATTCCAGGTGTGGGCAGTCCCTCTGTTATTGGTAGCTTTTTTTGCCTACTTAGTAGCCC 
ATAGTTTTTTATCTGTGTTTGAAACTGTGCTGGATGCACTTTTCCTGTGTTTTGCTGTTGAT 
CTGGAAACAAATGATGGATCGTCAGAAAAGCCCTACTTTATGGATCAAGAATTTCTGAGTTT 
CGTAAAAAGGAGCAACAAATTAAACAATGCAAGGGCACAGCAGGACAAGCACTCATTAAGGA 
ATGAGGAGGGAACAGAACTCCAGGCCATTGTGAGATAGATACCCATTTAGGTATCTGTACCT 
GGAAAACATTTCCTTCTAAGAGCCATTTACAGAATAGAAGATGAGACCACTAGAGAAAAGTT 
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FIGURE 111 

MSGRDT I LGLC I LALALSLAMMFTFRF I TTLLVH I F I S LV I LGLLFVCGVLWWL YYD YTNDL 

SIEU)TERENMKCVLGFAIVSTGITAVLLVLIFVLRKRIKLTVELFQITNKAISSAPFL^ 

PLWTFAILI FFWVLWVAVLLSLGTAGAAQVMEGGQVEYKPLSGI RYMWSYHLIGLI WTSEFI 

IJVCQQMTIAGAVVTCYFNRSKNDPPDHPILSSLSILFFYHQGTVVKGSFLISVVRIPRIIVM 

YMQNALKEQQHGALSRYLFRCCYCCFWCLDKYLLHLNQNAYTTTAINGTDFCTSAKDAFKIL 

SKNSSHFTSINCFGDFIIFLGKVLVVCFTVFGGLMAFNYNRAFQVWAVPLLLVAFFAYLVAH 

SFLSVFETVLDALFLCFAVDLETNDGSSEKPYFMDQEFLSFVKRSNKLNNARAQQDKHSLRN 
EEGTELQAIVR 
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FIGURK 112 

tgtggtgaaaattttttgaaaaaaaaattgccttcttcaaaSg^^ 

TATGAGGACTGTTGTTCTCy^CTATGAAGGCATCTGTTATTGAAATGTTCC™ 
TGACTGGAGTACATTCAAACAAAGAAACGGCAAAGAAGmAAAA^ 

JS^SS^™J^'=^°^^^^^^TACCATGTTTATGGCACTGACGTGTA^^^^ 

™^S^°^^^°°^°^°<='^^TACACAGTGGTGTGCTTGATAATTCAGGAGGGA^ 

GTTCGGAAGGTTGCTGGACAGTCTGGTTACAAAGGGAGTTATTCciASGTGTCCAA 

atccctaccacgatggagagaatcctttatcgtcttagaaagtaSc? 

CCTACCCATCAGCTCTTACATACTCATCATCGAAAAGT^GClScCCA^CA^ 
ACAAAAGCCTATCAGAGGCCACCTATTCCAGGGACAACTGCACAG^^ 

gcttctggctgtcactgtagctgtggccacccccaccacctI^^^^ 

aatccagaagcagctcctggctgatgttgcccaagctcttgS?gg?^5g^^^ 
gggctcccaatgtggtggtggtgatggtggatggctggcSacgSSca^ 

TCAAGACTTGCGAGAGAGTCAGGAATCAACATTTTCTTciT^^^ 
^??TA???S??cI?G?g?S?f^^^^ 

SS???S??J?Sg'?^^^^^^^^^^ 

tggcttcgtcatcgacggctccagcagtgtggggacgggSttccg^^^ 

ttgtgaccaacctcaccaaagagtttgagatttccgacac^gJSosSsc^^^ 

cagtacacctacgaacagcggctggagtttgggttcgacaagtacagcagc^^ 

cctcaacgccatcaagagggtgggctactggagtggtggcaSagcacgg^ 

acttcgccctggagcagctcttcaagaagtccaagScaacSggSgt?S^ 

atcaccgacgggaggtcctacgacgacgtccggatcccagccatggctc???^^^ 

agtgatcacctatgcgataggcgttgcctgggctgcccaagaggSSSot 

ctcaccccgccagagaccactccttctttgtggacgagtttgSaaSt?^^ 

cccaggatcatccagaacatttgtacagagttcaactcacagcc^tcggSotgSt™ 

caggcagagcaccagcaagtgctgctttactaactgacgtgt^^^ 



rJ?JS?rJ?J'J'^^''^°^^^™^°^'rTTGAACTCTG?S?^^^ 



AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA^ 
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MRTWLTMKASVIEMFLVLLVTGVHSNKETAKKIKRPKFTVPQINCDVKAGKIIDPEFIVKC 

PAGCQDPKYHVYGTDVYASYSSVCGAAVHSGVLDNSGGKILVRKVAGQSGYKGSYSNGVQSL 

SLPRWRESFIVLESKPKKGVTYPSALTYSSSKSPAAQAGETTKAYQRPPIPGTTAQPVTLMQ 

LLAVTVAVATPTTLPRPSPSAASTTSIPRPQSVGHRSQEMDLWSTATYTSSQNRPRADPGIQ 

RQDPSGAAFQKPVGADVSLGLVPKEELSTQSLEPVSLGDPNCKIDLSFLIDGSTSIGKRRFR 

IQKQLLADVAQALDIGPAGPLMGWQYGDNPATHFNLKTHTNSRDLKTAIEKITQRGGLSNV 

GRAISFVTKNFFSKANGNRSGAPNWWMVDGWPTDKVEEASRLARESGINIFFITIEGAAE 

NEKQYWEPNFANKAVCRTNGFYSLHVQSWFGLHKTLQPLVKRVCDTDRLACSKTCLNSADI 

GFVIDGSSSVGTGNFRTVLQFVTNLTKEFEISDTDTRIGAVQYTYEQRLEFGFDKYSSKPDI 

LNAIKRVGYWSGGTSTGAAINFALEQLFKKSKPNKRKLMILITDGRSYDDVRIPAMAAHLKG 

VITYAIGVAWAAQEELEVIATHPARDHSFFVDEFDNLHQYVPRIIQNICTEFNSQPRN 
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FIGURE 114 

CACX3ATGAACTGGTTGCAGTGGCTGCTGCTGCTGCGGGGGCGCTGAGAGGACACGAGCTCTA 

IGCCTTTCCGGCTGCTCATCCCGCTCGGCCTCCTGTGCGCGCTGCTGCCTCAGCACCATGGT 

GCGCCAGGTCCCGACGGCTCCGCGCCAGATCGCGCCCACTACAGTTTTTCTCTGACTCTAAT 

TGATGCACTGGACACCTTGCTGATTTTGGGGAATGTCTCAGAATTCCAAAGAGTGGTTGAAG 

TGCTCCAGGACAGCGTGGACTTTGATATTGATGTGAACGCCTCTGTGTTTGAAACAAACATT 

CGAGTGGTAGGAGGACTCCTGTCTGCTCATCTGCTCTCCAAGAAGGCTGGGGTGGAAGTAGA 

GGCTGGATGGCCCTGTTCCGGGCCTCTCCTGAGAATGGCTGAGGAGGCGGCCCGAAAACTCC 

TCCCAGCCTTTCAGACCCCCACTGGCATGCCATATGGAACAGTGAACTTACTTCATGGCGTG 

AACCCAGGAGAGACCCCTGTCACCTGTACGGCAGGGATTGGGACCTTCATTGTTGAATTTGC 

CACCCTGAGCAGCCTCACTGGTGACCCGGTGTTCGAAGATGTGGCCAGAGTGGCTTTGATGC 

GCCTCTGGGAGAGCCGGTCAGATATCGGGCTGGTCGGCAACCACATTGATGTGCTCACTGGC 

AAGTGGGTGGCCCAGGACGCAGGCATCGGGGCTGGCGTGGACTCCTACTTTGAGTACTTGGT 

GAAAGGAGCCATCCTGCTTCAGGATAAGAAGCTCATGGCCATGTTCCTAGAGTATAACAAAG 

CCATCCGGAACTACACCCGCTTCGATGACTGGTACCTGTGGGTTCAGATGTACAAGGGGACT 

GTGTCCATGCCAGTCTTCCAGTCCTTGGAGGCCTACTGGCCTGGTCTTCAGAGCCTCATTGG 

AGACATTGACAATGCCATGAGGACCTTCCTCAACTACTACACTGTATGGAAGCAGTTTGGGG 

GGCTCCCGGAATTCTACAACATTCCTCAGGGATACACAGTGGAGAAGCGAGAGGGCTACCCA 

CTTCGGCCAGAACTTATTGAAAGCGCAATGTACCTCTACCGTGCCACGGGGGATCCCACCCT 

CCTAGAACTCGGAAGAGATGCTGTGGAATCCATTGAAAAAATCAGCAAGGTGGAGTGCGGAT 

TTGCAACAATCAAAGATCTGCGAGACCACAAGCTGGACAACCGCATGGAGTCGTTCTTCCTG 

GCCGAGACTGTGAAATACCTCTACCTCCTGTTTGACCCAACCAACTTCATCCACAACAATGG 

GTCCACCTTCGACGCGGTGATCACCCCCTATGGGGAGTGCATCCTGGGGGCTGGGGGGTACA 

TCTTCAACACAGAAGCTCACCCCATCGACCTTGCCGCCCTGCACTGCTGCCAGAGGCTGAAG 

GAAGAGCAGTGGGAGGTGGAGGACTTGATGAGGGAATTCTACTCTCTCAAACGGAGCAGGTC 

GAAATTTCAGAAAAACACTGTTAGTTCGGGGCCATGGGAACCTCCAGCAAGGCCAGGAACAC 

TCTTCTCACCAGAAAACCATGACCAGGCAAGGGAGAGGAAGCCTGCCAAACAGAAGGTCCCA 

CTTCTCAGCTGCCCCAGTCAGCCCTTCACCTCCAAGTTGGCATTACTGGGACAGGTTTTCCT 

AGACTCCTCATAACCACTGGATAATTTTTTTATTTTTATTTTTTTGAGGCTAAACTATAATA 
AATTGCTTTTGGCTATCATAAAA 
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FIGURE 11^ 

MPFRLLIPLGLLCALLPQHHGAPGPDGSAPDPAHYSFSLTLIDALDTLLILGNVSEPQRWE 
VLQDSVDFDIDWASVFETNIRWGGLLSAHLLSKKAGVEVEAGWPCSGPLLRMAEEAARKL 
LPAFQTPTGMPYGTVNLLHGVNPGETPVTCTAGIGTFIVEFATLSSLTGDPVFEDVARVALM 
RLWESRSDIGLVGNHIDVLTGKWAQDAGIGAGVDSYFEYLVKGAILLQDKKLMAMFLEYNK 
AIRNYTRFDDWYLWVQMYKGTVSMPVFQSLEAYWPGLQSLIGDIDNAMRTFLNYYTVWKQFG 
GLPEFYNIPQGYTVEKREGYPLRPELIESAMYLYRATGDPTLLELGRDAVESIEKISKVECG 
FATIKDLRDHKLDNRMESFFLAETVKYLYLLFDPTNFIHNNGSTFDAVITPYGECILGAGGY 
IFNTEAHPIDLAALHCCQRLKEEQWEVEDLMREFYSLKRSRSKFQKNTVSSGPWEPPARPGT 
LFSPENHDQARERKPAKQKVPLLSCPSQPFTSKLALLGQVFLDSS 
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FIGURE 116 

AAAGTTACATTTTCTCTGGAACTCTCCTAGGCCACTCCCTGCTGATGCAACATCTGGGTTTG 

GGCAGAAAGGAGGGTGCTTCGGAGCCCGCCCTTTCTGAGCTTCCTGGGCCGGCTCTAGAACA 

ATTCAGGCTTCGCTGCGACTCAGACCTCAGCTCCAACATATGCATTCTGAAGAAAGATGGCT 

GAGATGGACAGAATGCTTTATTTTGGAAAGAAACAATGTTCTAGGTCAAACTGAGTCTACCA 

AATGCAGACTTTCACAATGGTTCTAGAAGAAATCTGGAOAGTCTTTTCATGTGGTTTTTCT 

• ACGCATTGATTCCATGTTTGCTCACAGATGAAGTGGCCATTCTGCCTGCCCCTCAGAACCTC 

TCTGTACTCTCAACCAACATGAAGCATCTCTTGATGTGGAGCCCAGTGATCGCGCCTGGAGA 

AACAGTGTACTATTCTGTCGAATACCAGGGGGAGTACGAGAGCCTGTACACGAGCCACATCT 

GGATCCCCAGCAGCTGGTGCTCACTCACTGAAGGTCCTGAGTGTGATGTCACTGATGACATC 

ACGGCCACTGTGCCATACAACCTTCGTGTCAGGGCCACATTGGGCTCACAGACCTCAGCCTG 

GAGCATCCTGAAGCATCCCTTTAATAGAAACTCAACCATCCTTACCCGACCTGGGATGGAGA 

TCACCAAAGATGGCTTCCACCTGGTTATTGAGCTGGAGGACCTGGGGCCCCAGTTTGAGTTC 

CTTGTGGCCTACTGGAGGAGGGAGCCTGGTGCCGAGGAACATGTCAAAATGGTGAGGAGTGG 

GGGTATTCCAGTGCACCTAGAAACCATGGAGCCAGGGGCTGCATACTGTGTGAAGGCCCAGA 

CATTCGTGAAGGCCATTGGGAGGTACAGCGCCTTCAGCCAGACAGAATGTGTGGAGGTGCAA 

GGAGAGGCCATTCCCCTGGTACTGGCCCTGTTTGCCTTTGTTGGCTTCATGCTGATCCTTGT 

GGTCGTGCCACTGTTCGTCTGGAAAATGGGCCGGCTGCTCCAGTACTCCTGTTGCCCCGTGG 

TGGTCCTCCCAGACACCTTGAAAATAACCAATTCACCCCAGAAGTTAATCAGCTGCAGAAGG 

GAGGAGGTGGATGCCTGTGCCACGGCTGTGATGTCTCCTGAGGAACTCCTCAGGGCCTGGAT 

CTCAiaSGTTTGCGGAAGGGCCCAGGTGAAGCCGAGAACCTGGTCTGCATGACATGGAAACC 

ATGAGGGGACAAGTTGTGTTTCTGTTTTCCGCCACGGACAAGGGATGAGAGAAGTAGGAAGA 

GCCTGTTGTCTACAAGTCTAGAAGCAACCATCAGAGGCAGGGTGGTTTGTCTAACAGAACAC 

TGACTGAGGCTTAGGGGATGTGACCTCTAGACTGGGGGCTGCCACTTGCTGGCTGAGCAACC 

CTGGGAAAAGTGACTTCATCCCTTCGGTCCTAAGTTTTCTCATCTGTAATGGGGGAATTACC 

TACACACCTGCTAAACACACACACACAGAGTCTCTCTCTATATATACACACGTACACATAAA 

TACACCCAGCACTTGCAAGGCTAGAGGGAAACTGGTGACACTCTACAGTCTGACTGATTCAG 

TGTTTCTGGAQAGCAGGACATAAATGTATGATGAGAATGATCAAGGACTCTACACACTGGGT 

GGCTTGGAGAGCCCACTTTCCCAGAATAATCCTTGAGAGAAAAGGAATCATGGGAGCAATGG 

TGTTGAGTTCACTTCAAGCCCAATGCCGGTGCAGAGGGGAATGGCTTAGCGAGCTCTACAGT 

AGGTGACCTGGAGGAAGGTCACAGCCACACTGAAAATGGGATGTGCATGAACACGGAGGATC 

CATGAACTACTGTAAAGTGTTGACAGTGTGTGCACACTGCAGACAGCAGGTGAAATGTATGT 

GTGCAATGCGACGAGAATGCAGAAGTCAGTAACATGTGCATGTTTGTTGTGCTCCTTTTTTC 

TGTTGGTAAAGTACAGAATTCAGCAAATAAAAAGGGCCACCCTGGCCAAAAGCGGTAAAAAA 
AAAAAAAAAA 
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MQTFTMVLEEIWTSLFMWFFYALIPCLLTDEVAILPAPQNLSVLSTNMKHLLMWSPVIAPG 

T\hnfSVEYQGEYESLYTSHIWIPSSWCSLTEGPECTVTDDITATVPYNIJ^ 

SILKHPFNRNSTILTRPGMEITKDGFHLVIELEDIXSPQFEFLVAYWRREPGAEEHVKM^ 

GIPVHLETMEPGAAYCVKAQTFVKAIGRYSAFSQTECVEVQGEAIPLVIJ^FAFVGFMLILV 

WPLFWKMGRLLQYSCCPVVVLPDTLKITNSPQKLISCRREEVDACATAVMSPEEL^ 

Is^ortant features : 
Signal peptide: 
amino acids 1-29 

Transmembrane domain: 

amino acids 230-255 

N-glycosylation sites, 

amino acids 40-43 and 134-137 

Tissue factor proteins homology. 

amino acids 92-119 

Integrins alpha chain protein homology. 

amino acids 232-262 
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FIGURE 118 

TCCTGCTGATGCACATCTGGGTTTGGCAAAAGGAGGTTGCTTCGAGCCGCCCTTTCTAGCTT 

CCTGGCCGGCTCTAGAACAATTCAGGCTTCGCTGCGACTAGACCTCAGCTCCAACATATGCA 

TTCTGAAGAAAGATGGCTGAGATGACAGAATGCTTTATTTTGGAAAGAAACAATGTTCTAGG 

TCAAACTGAGTCTACCAAATGCAGACTTTCACAATGGTTCTAGAAGAAATCTGGACAAGTCT 

TTTCATGTGGTTTTTCTACGCATTGATTCCATGTTTGCTCACAGATGAAGTGGCCATTCTGC 

CTGCCCCTCAGAACCTCTCTGTACTCTCAACCAACATGAAGCATCTCTTGATGTGGAGCCCA 

GTGATCGCGCCTGGAGAAACAGTGTACTATTCTGTCGAATACCAGGGGGAGTACGAGAGCCT 

GTACACGAGCCACATCTGGATCCCCAGCAGCTGGTGCTCACTCACTGAAGGTCCTGAGTGTG 

ATGTCACTGATGACATCACGGCCACTGTGCCATACAACCTTTGTGTCAGGGCCACATTGGGC 

TCACAGACCTCAGCCTGGAGCATCCTGAAGCATCCCTTTAATAGAAACTCAACCATCCTTAC 

CCGACCTGGGATGGAGATCACCAAAGATGGCTTNCACCTGGTTATTGAGCTGGAGGACCTGG 

GGCCCCAGTTTGAGTTCCTTGTGGCCTANTGGAGGAGGGGCGAACCCCTTGCGGCGCAAGGG 

GTTNGCGAACCCCTTGCGGCCGCTGGGGTATCTCTCGAGAAAAGAGAGGCCCAATATGACCCAC 

ATACTCAATATGGACGAANTGCTATTGTCCACCTGTTTGAGTGGCGCTGGGTTGAT 
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FIGURE 119 
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CGGACGCGTGGGCCGCCACCTCCGGAACAAGCCaTGGTGGCGGCGACGGTGGCAGCGGCGTG 

GCTGCTCCTGTGGGCTGCGGCCTGCGCGCAGCAGGAGCAGGACTTCTACGACTTCAAGGCGG 

TCAACATCCGGGGCAAACTGGTGTCGCTGGAGAAGTACCGCGGATCGGTGTCCCTGGTGGTG 

AATGTGGCCAGCGAGTGCGGCTTCACAGACCAGCACTACCGAGCCCTGCAGCAGCTGCAGCG 

AGACCTGGGCCCCCACCACTTTAACGTGCTCGCCTTCCCCTGCAACCAGTTTGGCCAACAGG 

AGCCTGACAGCAACAAGGAGATTGAGAGCTTTGCCCGCCGCACCTACAGTGTCTCATTCCCC 

ATGTTTAGCAAGATTGCAGTCACCGGTACTGGTGCCCATCCTGCCTTCAAGTACCTGGCCCA 

GACTTCTGGGAAGGAGCCCACCTGGAACTTCTGGAAGTACCTAGTAGCCCCAGATGGAAAGG 

TGGTAGGGGCTTGGGACCCAACTGTGTCAGTGGAGGAGGTCAGACCCCAGATCACAGCGCTC 

GTGAGGAAGCTCATCCTACTGAAGCGAGAAGACTTA^AACCACCGCGTCTCCTCCTCCACCA 

CCTCATCCCGCCCACCTGTGTGGGGCTGACCAATGCAAACTCAAATGGTGCTTCAAAGGGAG 

AGACCCACTGACTCTCCTTCCTTTACTCTTATGCCATTGGTCCCATCATTCTTGTGGGGGAA 

AAATTCTAGTATTTTGATTATTTGAATCTTACAGCAACAAATAGGAACTCCTGGCCAATGAG 

AGCTCTTGACCAGTGAATCACCAGCCGATACGAACGTCTTGCCAACAAAAATGTGTGGCAAA 

TAGAAGTATATCAAGCAATAATCTCCCACCCAAGGCTTCTGTAAACTGGGACCAATGATTAC 

CTCATAGGGCTGTTGTGAGGATTAGGATGAAATACCTGTGAAAGTGCCTAGGCAGTGCCAGC 

CAAATAGGAGGCATTCAATGAACATTTTTTGCATATAAACCAAAAAATAACTTGTTATCAAT 

AAAAACTTGCATCCAACATGAATTTCCAGCCGATGATAATCCAGGCCAAAGGTTTAGTTGTT 

GTTATTTCCTCTGTATTATTTTCTTCATTACAAAAGAAATGCAAGTTCATTGTAACAATCCA 

AACAATACCTCACGATATAAAATAAAAATGAAAGTATCCTCCTCAAAAA 
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FIGURE 120 



MVAATVAAAWLLLWAAACAQQEQDFYDFKAVNIRGKLVSLEKYRGSVSLWNVASECGFTDQ 
HYRALQQLQRDLGPHHFNVLAFPCNQFGQQEPDSNKEIESFARRTYSVSFPMFSKIAVTGTG 
i^PAFKYIAQTSGKEPTWNFWKYLVAPDGKVVGAWDPTVSVEEVRPQITALVRKLILLKREDL 
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FIGURE 121 
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CGGACGCGTGGGCGGGCCGGGACGCAGGGCAAAGCGAGCCATGGCTGTCTACGTCGGGATGC 

TGCGCCTGGGGAGGCTGTGCGCCGGGAGCTCGGGGGTGCTGGGGGCCCGGGCCGCCCTCTCT 

CGGAGTTGGCAGGAAGCCAGGTTGCAGGGTGTCCGCTTCCTCAGTTCCAGAGAGGTGGATCG 

CATGGTCTCCACGCCCATCGGAGGCCTCAGCTACGTTCAGGGGTGCACCAAAAAGCATCTTA 

ACAGCAAGACTGTGGGCCAGTGCCTGGAGACCACAGCACAGAGGGTCCCAGAACGAGAGGCC 

TTGGTCGTCCTCCATGAAGACGTCAGGTTGACCTTTGCCCAACTCAAGGAGGAGGTGGACAA 

AGCTGCTTCTGGCCTCCTGAGCATTGGCCTCTGCAAAGGTGACCGGCTGGGCATGTGGGGAC 

CTAACTCCTATGCATGGGTGCTCATGCAGTTGGCCACCGCCCAGGCGGGCATCATTCTGGTG 

TCTGTGAACCCAGCCTACCAGGCTATGGAACTGGAGTATGTCCTCAAGAAGGTGGGCTGCAA 

GGCCCTTGTGTTCCCCAAGCAATTCAAGACCCAGCAATACTACAACGTCCTGAAGCAGATCT 

GTCCAGAAGTGGAGAATGCCCAGCCAGGGGCCTTGAAGAGTCAGAGGCTCCCAGATCTGACC 

ACAGTCATCTCGGTGGATGCCCCTTTGCCGGGGACCCTGCTCCTGGATGAAGTGGTGGCGGC 

TGGCAGCACACGGCAGCATCTGGACCAGCTCCAATACAACCAGCAGTTCCTGTCCTGCCATG 

ACCCCATCAACATCCAGTTCACCTCGGGGACAACAGGCAGCCCCAAGGGGGCCACCCTCTCC 

CACTACAACATTGTCAACAACTCCAACATTTTAGGAGAGCGCCTGAAACTGCATGAGAAGAC 

ACCAGAGCAGTTGCGGATGATCCTGCCCAACCCCCTGTACCATTGCCTGGGTTCCGTGGCAG 

GCACAATGATGTGTCTGATGTACGGTGCCACCCTCATCCTGGCCTCTCCCATCTTCAATGGC 

AAGAAGGCACTGGAGGCCATCAGCAGAGAGAGAGGCACCTTCCTGTATGGTACCCCCACGAT 

GTTCGTGGACATTCTGAACCAGCCAGACTTCTCCAGTTATGACATCTCGACCATGTGTGGAG 

GTGTCATTGCTGGGTCCCCTGCACCTCCAGAGTTGATCCGAGCCATCATCAACAAGATAAAT 

ATGAAGGACCTGGTGGTTGCTTATGGAACCACAGAGAACAGTCCCGTGACATTCGCGCACTT 

CCCTGAGGACACTGTGGAGCAGAAGGCAGAAAGCGTGGGCAGAATTATGCCTCACACGGAGG 

CCCGGATCATGAACATGGAGGCAGGGACGCTGGCAAAGCTGAACACGCCCGGGGAGCTGTGC 

ATCCGAGGGTACTGCGTCATGCTGGGCTACTGGGGTGAGCCTCAGAAGACAGAGGAAGCAGT 

GGATCAGGACAAGTGGTATTGGACAGGAGATGTCGCCACAATGAATGAGCAGGGCTTCTGCA 

AGATCGTGGGCCGCTCTAAGGATATGATCATCCGGGGTGGTGAGAACATCTACCCCGCAGAG 

CTCGAGGACTTCTTTCACACACACCCGAAGGTGCAGGAAGTGCAGGTGGTGGGAGTGAAGGA 

CGATCGGATGGGGGAAGAGATTTGTGCCTGCATTCGGCTGAAGGACGGGGAGGAGACCACGG 

TGGAGGAGATAAAAGCTTTCTGCAAAGGGAAGATCTCTCACTTCAAGATTCCGAAGTACATC 

GTGTTTGTCACAAACTACCCCCTCACCATTTCAGGAAAGATCCAGAAATTCAAACTTCGAGA 

GCAGATGGAACGACATCTAAATCTGTGAATAAAGCAGCAGGCCTGTCCTGGCCGGTTGGCTT 

GACTCTCTCCTGTCAGAATGCAACCTGGCTTTATGCACCTAGATGTCCCCAGCACCCAGTTC 

TGAGCCAGGCACATCAAATGTCAAGGAATTGACTGAACGAACTAAGAGCTCCTGGATGGGTC 

CGGGAACTCGCCTGGGCACAAGGTGCCAAAAGGCAGGCAGCCTGCCCAGGCCCTCCCTCCTG 

TCCATCCCCCACATTCCCCTGTCTGTCCTTGTGATTTGGCATAAAGAGCTTCTGTTTTCTTT 

GAAAAAAAAAAAAAAAA 
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FIGURE 1 22 

MAVYVGMLRLGRLCAGSSGVLGARAALSRSWQEARLQGVRFLSSREVDRMVSTPIGGLSYVQ 
GCTKKHLNSKTVGQCLETTAQRVPEREALWLHEDVRLTFAQLKEEVDKAASGLLSIGLCKG 
DRLGMWGPNSYAWVLMQLATAQAGIILVSVNPAYQAMELEYVLKKVGCKALVFPKQFKTQQY 
YNVLKQICPEVENAQPGALKSQRLPDLTTVISVDAPLPGTLLLDEWAAGSTRQHLDQLQYN 
QQFLSCHDPINIQFTSGTTGSPKGATLSHYNIVNNSNILGERLKLHEKTPEQLRMILPNPLY 
HCLGSVAGTMMCLMYGATLI LASP IFNGKKALEA I SRERGTFLYGTPTMFVDI LNQPDFSS Y 
DISTMCGGVIAGSPAPPELIRAIINKINMKDLWAYGTTENSPVTFAHFPEDTVEQKAESVG 
RIMPHTEARIMNMEAGTLAKLNTPGELCIRGYCVMLGYWGEPQKTEEAVDQDKWYWTGDVAT 
MNEQGFCKIVGRSKDMIIRGGENIYPAELEDFFHTHPKVQEVQWGVKDDRMGEEICACIRL 
KDGEETTVEEIKAFCKGKISHFKIPKYIVFVTNYPLTISGKIQKFKLREQMERHLNL 
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FIGURE 121 

CAACTCCAACATTTTAGGAGAGCGCCTGAAACTGCATGAGAAGACACCAGAGCAGTTGCGGA 

TGATCCTGCCCAACCCCCTGTACCATTGCCTGGGTTCCGTGGCAGGCACAATGATGTGTCTG 

ATGTACGGTGCCACCCTCATCCTGGCCTCTCCCATCTTCAATGGCAAGAAGGCACTGGAGGC 

CATCAGCAGAGAGAGAGGCACCTTCCTGTATGGTACCCCCACGATGTTCGTGGACATTCTGA 

ACCAGCCAGACTTCTCCAGTTATGACATCTCGACCATGTGTGGAGGTGTCATTGCTGGGTCC 

CCTGCACCTCCAGAGTTGATCCGAGCCATCATCAACAAGATAAATATGAAGGACCTGGTGGT 

TGCTTATGGAACCACAGAGAACAGTCCCGTGACATTCGCGCACTTCCCTGAGGACACTGTGG 

AGCAGAAGGCAGAAAGCGTGGGCAGAATTATGCCTCACACGGAGGCGCGGATCATGAACATG 

GAGGCAGGGACGCTGGCAAAGCTGAACACGCCCGGGGAGCTGTGCATCCGAGGGTACTGCGt 

CATGCTGGGCTACTGGGGTGAGCCTCAGAAGACAGAGGAAGCAGTGGATCAGGACAAGTGGT 
ATTGGACAGGAGATGTCGCCAC 
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GAGCAGGACGGAGCCATGGACCCCGCCAGGAAAGCAGGTGCCCAGGCCATGATCTGGACTGC 

AGGCTGGCTGCTGCTGCTGCTGCTTCGCGGAGGAGCGCAGGCCCTGGAGTGCTACAGCTGCG 

TGCAGAAAGCAGATGACGGATGCTCCCCGAACAAGATGAAGACAGTGAAGTGCGCGCCGGGC 

GTGGACGTCTGCACCGAGGCCGTGGGGGCGGTGGAGACCATCCACGGACAATTCTCGCTGGC 

AGTGCGGGGTTGCGGTTCGGGACTCCCCGGCAAGAATGACCGCGGCCTGGATCTTCACGGGC 

TTCTGGCGTTCATCCAGCTGCAGCAATGCGCTCAGGATCGCTGCAACGCCAAGCTCAACCTC 

ACCTCGCGGGCGCTCGACCCGGCAGGTAATGAGAGTGCATACCCGCCCAACGGCGTGGAGTG 

CTACAGCTGTGTGGGCCTGAGCCGGGAGGCGTGCCAGGGTACATCGCCGCCGGTCGTGAGCT 

GCTACAACGCCAGCGATCATGTCTACAAGGGCTGCTTCGACGGCAACGTCACCTTGACGGCA 

GCTAATGTGACTGTGTCCTTGCCTGTCCGGGGCTGTGTCCAGGATGAATTCTGCACTCGGGA 

TGGAGTAACAGGCCCAGGGTTCACGCTCAGTGGCTCCTGTTGCCAGGGGTCCCGCTGTAACT 

CTGACCTCCGCAACAAGACCTACTTCTCCCCTCGAATCCCACCCCTTGTCCGGCTGCCCCCT 

CCAGAGCCCACGACTGTGGCCTCAACCACATCTGTCACCACTTCTACCTCGGCCCCAGTGAG 

ACCCACATCCACCACCAAACCCATGCCAGCGCCAACCAGTCAGACTCCGAGACAGGGAGTAG 

AACACGAGGCCTCCCGGGATGAGGAGCCCAGGTTGACTGGAGGCGCCGCTGGCCACCAGGAC 

CGCAGCAATTCAGGGCAGTATCCTGCAAAAGGGGGGCCCCAGCAGCCCCATAATAAAGGCTG 

TGTGGCTCCCACAGCTGGATTGGCAGCCCTTCTGTTGGCCGTGGCTGCTGGTGTCCTACTGT 

fiAGCTTCTCCACCTGGAAATTTCCCTCTCACCTACTTCTCTGGCCCTGGGTACCCCTCTTCT 

CATCACTTCCTGTTCCCACCACTGGACTGGGCTGGCCCAGCCCCTGTTTTTCCAACATTCCC 

CAGTATCCCCAGCTTCTGCTGCGCTGGTTTGCGGCTTTGGGAAATAAAATACCGTTGTATAT 

ATTCTGCCAGGGGTGTTCTAGCTTTTTGAGGACAGCTCCTGTATCCTTCTCATCCTTGTCTC 

TCCGCTTGTCCTCTTGTGATGTTAGGACAGAGTGAGAGAAGTCAGCTGTCACGGGGAAGGTG 

AGAGAGAGGATGCTAAGCTTCCTACTCACTTTCTCCTAGCCAGCCTGGACTTTGGAGCGTGG 

GGTGGGTGGGACAATGGCTCCCCACTCTAAGCACTGCCTCCCCTACTCCCCGCATCTTTGGG 

GAATCGGTTCCCCATATGTCTTCCTTACTAGACTGTGAGCTCCTCGAGGGGGGGCCCGGTAC 
CCAATTCGCCCTATAGTGAGTCGTA 
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MDPARKAGAQAMIWTAGWLIJ^GGAQALECYSCVQKADDGCSPNKMKTVKCAPC^^ 

EAVGAVETIHGQFSIAVRGCGSGLPGKNDRGLDLHGI^IQI^QCAQDRCNAKLNLTSRAL 

DPAGNESAYPPNGVECYSCVGLSREACQGTSPPWSCYNASDHVYKGCPDGNVTLTAANVTV 

SLPVRGCVQDEFCTRDGVTGPGFTLSGSCCQGSRCNSDLRNKTYFSPRIPPLVRLPPPEPTT 

VASTTSVTTSTSAPVRPTSTTKPMPAPTSQTPRQGVEHEASRDEEPRLTGGAAGHQDRSNSG 
QYPAKGGPQQPHNKGCVAPTAGLAALLLAVAAGVLL 
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FIGURE 126 

CGGGACTCGGCGGGTCCTCCTGGGAGTCTCGGAGGGGACCGGCTGTGCAGACGCCATGGAGT 

TGGTGCTGGTCTTCCTCTGCAGCCTGCTGGCCCCCATGGTCCTGGCCAGTGCAGCTGAAAAG 

GAGAAGGAAATGGACCCTTTTCATTATGATTACCAGACCCTGAGGATTGGGGGACTGGTGTT 

CGCTGTGGTCCTCTTCTCGGTTGGGATCCTCCTTATCCTAAGTCGCAGGTGCAAGTGCAGTT 

TCAATCAGAAGCCCCGGGCCCCAGGAGATGAGGAAGCCCAGGTGGAGAACCTCATCACCGCC 

AATGCAACAGAGCCCCAGAAGCAGAGAACTGAAGTGCAGCCATCAGGTGGAAGCCTCTGGAA 

CCTGAGGCGGCTGCTTGAACCTTTGGATGCAAATGTCGATGCTTAAGAAAACCGGCCACTTC 

AGCAACAGCCCTTTCCCCAGGAGAAGCCAAGAACTTGTGTGTCCCCCACCCTATCCCCTCTA 

ACACCATTCCTCCACCTGATGATGCAACTAACACTTGCCTCCCCACTGCAGCCTGCGGTCCT 

GCCCACCTCCCGTGATGTGTGTGTGTGTGTGTGTGTGTGACTGTGTGTGTTTGCTAACTGTG 

GTCTTTGTGGCTACTTGTTTGTGGATGGTATTGTGTTTGTTAGTGAACTGTGGACTCGCTTT 

CCCAGGCAGGGGCTGAGCCACATGGCCATCTGCTCCTCCCTGCCCCCGTGGCCCTCCATCAC 

CTTCTGCTCCTAGGAGGCTGCTTGTTGCCCGAGACCAGCCCCCTCCCCTGATTTAGGGATGC 

GTAGGGTAAGAGCACGGGCAGTGGTCTTCAGTCGTCTTGGGACCTGGGAAGGTTTGCAGCAC 

TTTGTCATCATTCTTCATGGACTCCTTTCACTCCTTTAACAAAAACCTTGCTTCCTTATCCC 

ACCTGATCCCAGTCTGAAGGTCTCTTAGCAACTGGAGATACAAAGCAAGGAGCTGGTGAGCC 

CAGCGTTGACGTCAGGCAGGCTATGCCCTTCCGTGGTTAATTTCTTCCCAGGGGCTTCCACG 

AGGAGTCCCCATCTGCCCCGCCCCTTCACAGAGCGCCCGGGGATTCCAGGCCCAGGGCTTCT 

ACTCTGCCCCTGGGGAATGTGTCCCCTGCATATCTTCTCAGCAATAACTCCATGGGCTCTGG 

GACCCTACCCCTTCCAACCTTCCCTGCTTCTGAGACTTCAATCTACAGCCCAGCTCATCCAG 

ATGCAGACTACAGTCCCTGCAATTGGGTCTCTGGCAGGCAATAGTTGAAGGACTCCTGTTCC 

GTTGGGGCCAGCACACCGGGATGGATGGAGGGAGAGCAGAGGCCTTTGCTTCTCTGCCTACG 

TCCCCTTAGATGGGCAGCAGAGGCAACTCCCGCATCCTTTGCTCTGCCTGTCGGTGGTCAGA 

GCGGTGAGCGAGGTGGGTTGGAGACTCAGCAGGCTCCGTGCAGCCCTTGGGAACAGTGAGAG 

GTTGAAGGTCATAACGAGAGTGGGAACTCAACCCAGATCCCGCCCCTCCTGTCCTCTGTGTT 

CCCGCGGAAACCAACCAAACCGTGCGCTGTGACCCATTGCTGTTCTCTGTATCGTGATCTAT 

CCTCAACAACAACAGAAAAAAGGAATAAAATATCCTTTGTTTCCT 
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FIGURE 127 

MELVLVFLCSLLAPMVLASAAEKEKEMDPFHYDYQTLRIGGLVPAWLFSVGILLILSRRCK 
CSFNQKPRAPGDEEAQVENLITANATEPQKQRTEVQPSGGSLWNLRRLLEPLDANVDA 
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FIGURE 128 

AAACTTGACGCCATGAAGATCCCGGTCCTTCCTGCCGTGGTGCTCCTCTCCCTCCT 

CCACTCTGCCCAGGGAGCCACCCTGGGTGGTCCTGAGGAAGAAAGCACCATTGAGAATTATG 

CGTCACGACCCGAGGCCITTAACACCCCGTTCCTGAACATCGACAAATTGCGATCTGCGTTT 

AAGGCTGATGAGTTCCTGAACTGGCACGCCCTCTTTGAGTCTATCAAAAGGAAACTTCCTTT 

CCTCAACTGGGATGCCTTTCCTAAGCTGAAAGGACTGAGGAGCGCAACTCCTGATGCCCAGT 

GACCATGACCTCCACTGGAAGAGGGGGCTAGCGTGAGCGCTGATTCTCAACCTACCATAACT 
CTTTCCTGCCTCAGGAACTCCAATAAAACATTTTCCATCCAAA 
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FIGURE 129 

MKIPVLPAVVLLSLLVLHSAQGATLGGPEEESTIENYASRPEAFNTPFIjgiDKLRSAFKADE 
FLNWHALFES IKRKLPFLNWDAFPKLKGLRSATPDAQ 
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FIGURE 130 

CAGTTCTGAAATCAATGGAGTTAATTTAGGGAATACAAACCAGCCMSGGGGTGGAGATTC 

CTTTGCCTCAGTGATTCTOiCCreCCTCTCCCTTCTGGCAGCAGGAGTCTCCCAGGTTGTTC 

TTCTCCAGCCAGTTCCAACTCAGGAGACAGGTCCCAAGGCCATGGGAGATCTCTCCTGTGGC 

TTTGCCGGCCACTCATGAGAGTGTTTTTGTGTAAAGTATTTTTTAGAATACTGTTGACTTCT 

TCATGATTTAATAACCATCCTTTGCGAAGTTTTATGAGGCTTTAGGGGAATGTCAACCCTCA 

AATTTTTGTTATACTAGATGGCTTCCATTTACCCACCACTATTTTAAGGTCCCTTTATTTTT 

AGGTTCAAGGTTCATTTGACTTGAGAAAGTGCCCTTCTGCAGCTTCATTGATTTTGTTTATC 

TTCACTATTAATTGTAACGATTAAAAAAGAATAAGAGCACGCAGACCTCTAGGAGAATATTT 

TATCCCTGGGTGCCCCTGACACATTTATGTAGTGATCCCACAAATGTGATTGTTAATTTAAA 

TGTTATTCTAATATTAGTACATTCAGTTGTGATGTAATATGAATAACCAGAATCTATTTCTT 

AAAAGTTTTGAGTATATTTTTCAACTAGATATTTGTATAGAAAGACTGAATAGTGATG 
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FIGURK 

MGVEIAFASVILTCLSLLAAGVSQWLLQPVPTQETGPKAMGDLSCGFAGHS 
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FIGURE 132 



GGGGAATCTGCAGTAGGTCTGCCGGCGATGGAGTGGTGGGCTAGCTCGCCGCTTCGGCTCTG 

GCTGCTGTTGTTCCTCCTGCCCTCAGCGCAGGGCCGCCAGAAGGAGTCAGGTTCAAAATGGA 

AAGTATTTATTGACCAAATTAACAGGTCTTTGGAGAATTACGAACCATGTTCAAGTCAAAAC 

TGCAGCTGCTACCATGGTGTCATAGAAGAGGATCTAACTCCTTTCCGAGGAGGCATCTCCAG 

GAAGATGATGGCAGAGGTAGTCAGACGGAAGCTAGGGACCCACTATCAGATCACTAAGAACA 

GACTGTACCGGGAAAATGACTGCATGTTCCCCTCAAGGTGTAGTGGTGTTGAGCACTTTATT 

TTGGAAGTGATCGGGCGTCTCCCTGACATGGAGATGGTGATCAATGTACGAGATTATCCTCA 

GGTTCCTAAATGGATGGAGCCTGCCATCCCAGTCTTCTCCTTCAGTAAGACATCAGAGTACC 

ATGATATCATGTATCCTGCTTGGACATTTTGGGAAGGGGGACCTGCTGTTTGGCCAATTTAT 

CCTACAGGTCTTGGACGGTGGGACCTCTTCAGAGAAGATCTGGTAAGGTCAGCAGCACAGTG 

GCCATGGAAAAAGAAAAACTCTACAGCATATTTCCGAGGATCAAGGACAAGTCCAGAACGAG 

ATCCTCTCATTCTTCTGTCTCGGAAAAACCCAAAACTTGTTGATGCAGAATACACCAAAAAC 

CAGGCCTGGAAATCTATGAAAGATACCTTAGGAAAGCCAGCTGCTAAGGATGTCCATCTTGT 

GGATCACTGCAAATACAAGTATCTGTTTAATTTTCGAGGCGTAGCTGCAAGTTTCCGGTTTA 

AACACCTCTTCCTGTGTGGCTCACTTGTTTTCCATGTTGGTGATGAGTGGCTAGAATTCTTC 

TATCCACAGCTGAAGCCATGGGTTCACTATATCCCAGTCAAAACAGATCTCTCCAATGTCCA 

AGAGCTGTTACAATTTGTAAAAGCAAATGATGATGTAGCTCAAGAGATTGCTGAAAGGGGAA 

GCCAGTTTATTAGGAACCATTTGCAGATGGATGACATCACCTGTTACTGGGAGAACCTCTTG 

AGTGAATACTCTAAATTCCTGTCTTATAATGTAACGAGAAGGAAAGGTTATGATCAAATTAT 

TCCCAAAATGTTGAAAACTGAACTATAGTAGTCATCATAGGACCATAGTCCTCTTTGTGGCA 

ACAGATCTCAGATATCCTACGGTGAGAAGCTTACCATAAGCTTGGCTCCTATACCTTGAATA 

TCTGCTATCAAGCCAAATACCTGGTTTTCCTTATCATGCTGCACCCAGAGCAACTCTTGAGA 

AAGATTTAAAATGTGTCTAATACACTGATATGAAGCAGTTCAACTTTTTGGATGAATAAGGA 

CCAGAAATCGTGAGATGTGGATTTTGAACCCAACTCTACCTTTCATTTTCTTAAGACCAATC 

ACAGCTTGTGCCTCAGATCATCCACCTGTGTGAGTCCATCACTGTGAAATTGACTGTGTCCA 

TGTGATGATGCCCTTTGTCCCATTATTTGGAGCAGAAAATTCGTCATTTGGAAGTAGTACAA 

CTCATTGCTGGAATTGTGAAATTATTCAAGGCGTGATCTCTGTCACTTTATTTTAATGTAGG 

AAACCCTATGGGGTTTATGAAAAATACTTGGGGATCATTCTCTGAATGGTCTAAGGAAGCGG 

TAGCCATGCCATGCAATGATGTAGGAGTTCTCTTTTGTAAAACCATAAACTCTGTTACTCAG 

GAGGTTTCTATAATGCCACATAGAAAGAGGCCAATTGCATGAGTAATTATTGCAATTGGATT 

TCAGGTTCCCTTTTTGTGCCTTCATGCCCTACTTCTTAATGCCTCTCTAAAGCCAAA 
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FIGURE 

MEWWASSPLRLWLLLFLLPSAQGRQKESGSKWKVFIDQINRSLENYEPCSSQNCSCYHGVIE 

EDLTPFRGGISRKMMAEVVRRKLGTHYQITKNRLYRENDCMFPSRCSGVEHFILEVIGRLPD 

MEMVINVRDYPQVPKWMEPAIPVFSFSKTSEYHDIMYPAWTFWEGGPAVWPIYPTGLGRWDL 

FREDLVRSAAQWPWKKKNSTAYFRGSRTSPERDPLILLSRKNPKLVDAEYTKNQAWKSMKDT 

LGKPAAKDVHLVDHCKYKYLFNFRGVAASFRFKHLFLCGSLVFHVGDEWLEFFYPQLKPWVH 

YIPVKTDLSNVQELLQFVKANDDVAQEIAERGSQFIRNHLQMDDITCYWENLLSEYSKFLSY 
NVTRRKGYDQI IPKMLKTEL 
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FIGURE 1 ^4 

aVCCCCTCCATTTCTCGCCATGGCCCCTGCACTGCTCCTGATCCCTGCTGCCCTCGCCTCTT 

TCATCCTGGCCTTTGGCACCGGAGTGGAGTTCGTGCGCTTTACCTCCCTTCGGCCACTTCTT 

GGAGGGATCCCGGAGTCTGGTGGTCCGGATGCCCGCCAGGGATGGCTGGCTGCCCTGCAGGA 

CCGCAGCATCCTTGCCCCCCTGGCATGGGATCTGGGGCTCCTGCTTCTATTTGTTGGGCAGC 

ACAGCCTCATGGCAGCTGAAAGAGTGAAGGCATGGACATCCCGGTACTTTGGGGTCCTTCAG 

AGGTCACTGTATGTGGCCTGCACTGCCCTGGCCTTGCAGCTGGTGATGCGGTACTGGGAGCC 

CATACCCAAAGGCCCTGTGTTGTGGGAGGCTCGGGCTGAGCCATGGGCCACCTGGGTGCCGC 

TCCTCTGCTTTGTGCTCCATGTCATCTCCTGGCTCCTCATCTTTAGCATCCTTCTCGTCTTT 

GACTATGCTGAGCTCATGGGCCTCT^CAGGTATACTACCATGTGCTGGGGCTGGGCGAGCC 

TCTGGCCCTGAAGTCTCCCCGGGCTCTCAGACTCTTCTCCCACCTGCGCCACCCAGTGTGTG 

TGGAGCTGCTGACAGTGCTGTGGGTGGTGCCTACCCTGGGCACGGACCGTCTCCTCCTTGCT 

TTCCTCCTTACCCTCTACCTGGGCCTGGCTCACGGGCTTGATCAGCAAGACCTCCGCTACCT 

CCGGGCCCAGCTACAAAGAAAACTCCACCTGCTCTCTCGGCCCCAGGATGGGGAGGCAGAGT 

6AGGAGCTCACTCTGGTTACAAGCCCTGTTCTTCCTCTCCCACTGAATTCTAAATCCTTAAC 

ATCCAGGCCCTGGCTGCTTCATGCCAGAGGCCCAAATCCATGGACTGAAGGAGATGCCCCTT 

CTACTACTTGAGACTTTATTCTCTGGGTCCAGCTCCATACCCTAAATTCTGAGTTTCAGCCA 

CTGAACTCCAAGGTCCACTTCTCACCAGCAAGGAAGAGTGGGGTATGGAAGTCATCTGTCCC 

TTCACTGTTTAGAGCATGACACTCTCCCCCTCAACAGCCTCCTGAGAAGGAAAGGATCTGCC 

CTGACCACTCCCCTGGCACTGTTACTTGCCTCTGCGCCTCAGGGGTCCCCTTCTGCACCGCT 

GGCTTCCACTCCAAGAAGGTGGACCAGGGTCTGCAAGTTCAACGGTCATAGCTGTCCCTCCA 

GGCCCCAACCTTGCCTCACCACTCCCGGCCCTAGTCTCTGCACCTCCTTAGGCCCTGCCTCT 

GGGCTCAGACCCCAACCTAGTCAAGGGGATTCTCCTGCTCTTAACTCGATGACTTGGGGCTC 

CCTGCTCTCCCGAGGAAGATGCTCTGCAGGAAAATAAAAGTCAGCCTTTTTCTAAAAAAAA 
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I^PALLLIPAMASFILAFGTGVEFVRFTSLRPLLGGIPESGGPDARQGWLAALQDRSILAP 

lAWDKSLLLLFVGQHSimAERVKAWTSRYFGVLQRSLYVACTALALQLVMRYWEPIPKGPV 

LWEARAEPWAIWPLLCFVLHVISWLLIFSILLVFDyAELMGI^QVYYHVK3IX3EPL^^ 
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KLHLLSRPQDGEAE 
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FIGURE 136 

CCGAGCACAGGAGATTGCCTGCGTTTAGGAGGTGGCTGCGTTGTGGGAAAAGCTATCAAGGA 

AGAAATTGCCAAACCATGTCTTTTTTTCTGTTTTCAGAGTAGTTCACAACAGATCTGAGTGT 

TTTAATTAAGCATGGAATACAGAAAACAACAAAAAACTTAAGCTTTAATTTCATCTGGAATT 

CCACAGTTTTCTTAGCTCCCTGGACCCGGTTGACCTGTTGGCTCTTCCCGCTGGCTGCTCTA 

TCACGTGGTGCTCTCCGACTACTCACCCCGAGTGTAAAGAACCTTCGGCTCGCGTGCTTCTG 

AGCTGCTGTGGATGGCCTCGGCTCTCTGGACTGTCCTTCCGAGTAGGATGTCACTGAGATCC 

CTCAAATGGAGCCTCCTGCTGCTGTCACTCCTGAGTTTCTTTGTGATGTGGTACCTCAGCCT 

TCCCCACTACAATGTGATAGAACGCGTGAACTGGATGTACTTCTATGAGTATGAGCCGATTT 

ACAGACAAGACTTTCACTTCACACTTCGAGAGCATTCAAACTGCTCTCATCAAAATCCATTT 

CTGGTCATTCTGGT6ACCTCCCACCCTTCAGATGTGAAAGCCAGGCAGGCCATTAGAGTTAC 

TTGGGGTGAAAAAAAGTCTTGGTGGGGATATGAGGTTCTTACATTTTTCTTATTAGGCCAAG 

AGGCTGAAAAGGAAGACAAAATGTTGGCATTGTCCTTAGAGGATGAACACCTTCTTTATGGT 

GACATAATCCGACAAGATTTTTTAGACACATATAATAACCTGACCTTGAAAACCATTATGGC 

ATTCAGGTGGGTAACTGAGTTTTGCCCCAATGCCAAGTACGTAATGAAGACAGACACTGATG 

TTTTCATCAATACTGGCAATTTAGTGAAGTATCTTTTAAACCTAAACCACTCAGAGAAGTTT 

TTCACAGGTTATCCTCTAATTGATAATTATTCCTATAGAGGATTTTACCAAAAAACCCATAT 

TTCTTACCAGGAGTATCCTTTCAAGGTGTTCCCTCCATACTGCAGTGGGTTGGGTTATATAA 

TGTCCAGAGATTTGGTGCCAAGGATCTATGAAATGATGGGTCACGTAAAACCCATCAAGTTT 

GAAGATGTTTATGTCGGGATCTGTTTGAATTTATTAAAAGTGAACATTCATATTCCAGAAGA 

CACAAATCTTTTCTTTCTATATAGAATCCATTTGGATGTCTGTCAACTGAGACGTGTGATTG 

CAGCCCATGGCTTTTCTTCCAAGGAGATCATCACTTTTTGGCAGGTCATGCTAAGGAACACC 

ACATGCCATTATTAACTTCACATTCTACAAAAAGCCTAGAAGGACAGGATACCTTGTGGAAA 

GTGTTAAATAAAGTAGGTACTGTGGAAAATTCATGGGGAGGTCAGTGTGCTGGCTTACACTG 

AACTGAAACTCATGAAAAACCCAGACTGGAGACTGGAGGGTTACACTTGTGATTTATTAGTC 

AGGCCCTTCAAAGATGATATGTGGAGGAATTAAATATAAAGGAATTGGAGGTTTTTGCTAAA 

GAAATTAATAGGACCAAACAATTTGGACATGTCATTCTGTAGACTAGAATTTCTTAAAAGGG 

TGTTACTGAGTTATAAGCTCACTAGGCTGTAAAAACAAAACAATGTAGAGTTTTATTTATTG 

AACAATGTAGTCACTTGAAGGTTTTGTGTATATCTTATGTGGATTACCAATTTAAAAATATA 

TGTAGTTCTGTGTCAAAAAACTTCTTCACTGAAGTTATACTGAACAAAATTTTACCTGTTTT 

TGGTCATTTATAAAGTACTTCAAGATGTTGCAGTATTTCACAGTTATTATTATTTAAAATTA 

CTTCAACTTTGTGTTTTTAAATGTTTTGACGATTTCAATACAAGATAAAAAGGATAGTGAAT 

CATTCTTTACATGCAAACATTTTCCAGTTACTTAACTGATCAGTTTATTATTGATACATCAC 

TCCATTAATGTAAAGTCATAGGTCATTATTGCATATCAGTAATCTCTTGGACTTTGTTAAAT 

ATTTTACTGTGGTAATATAGAGAAGAATTAAAGCAAGAAAATCTGAAAA 
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FIGURE 117 

MASALVnVLPSRMSLRSLKWSLLLLSLLSFFVMWYLSLPHYNVIERVNWMYFYEYEPIYRQD 

FHFTLREHSNCSHQNPFLVILVTSHPSDVKARQAIRVTWGEKKSWWGYEVLTFFLLGQEAEK 

EDmiALSLEDEHLLYGDIIRQDFU)TYNNLTLKTIMAFRWVTEFCPNAKYVMKTDTDVFIN 

TGNLVKYLLNLNHSEKFFTGYPLIDNYSYRGFYQKTHISYQEYPFKVFPPYCSGLGYIMSRD 

LVPRIYEMMGHVKPIKFEDVYVGICLNLLKVNIHIPEDTNLFFLYRIHLDVCQLRRVIAAHG 
FSSKEI ITFWQVMLRNTTCHY 
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FIGURE 138 
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CCTCTGTCCACTGCTTTCGTGAAGACAAGATGAAGTTCACAATTGTCTTTGCTGGACTTCTT 
GGAGTCTTTCTAGCTCCTGCCCTAGCTAACTATAATATCAACGTCAATGATGACAACAACAA 
TGCTGGAAGTGGGCAGCAGTCAGTGAGTGTCAACAATGAACACAATGTGGCCAATGTTGACA 
ATAACAACGGATGGGACTCCTGGAATTCCATCTGGGATTATGGAAATGGCTTTGCTGCAACC 
AGACTCTTTCAAAAGAAGACATGCATTGTGCACAAAATGAACAAGGAAGTCATGCCCTCCAT 
TCAATCCCTTGATGCACTGGTCAAGGAAAAGAAGCTTCAGGGTAAGGGACCAGGAGGACCAC 
CTCCCAAGGGCCTGATGTACTCAGTCAACCCAAACAAAGTCGATGACCTGAGCAAGTTCGGA 
AAAAACATTGCAAACATGTGTCGTGGGATTCCAACATACATGGCTGAGGAGATGCAAGAGGC 
AAGCCTGTTTTTTTACTCAGGAACGTGCTACACGACCAGTGTACTATGGATTGTGGACATTT 
CCTTCTGTGGAGACACGGTGGAGAACTA^CAATTTTTTAAAGCCACTATGGATTTAGTCAT 
CTGAATATGCTGTGCAGAAAAAATATGGGCTCCAGTGGTTTTTACCATGTCATTCTGAAATT 
TTTCTCTACTAGTTATGTTTGATTTCTTTAAGTTTCAATAAAATCATTTAGCATTGAAAAAAA 
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FIGURE 119 

MKFTIVFAGLLGVFIAPALANYNINVNDDNIWAGSGQQSVSVNNEHNVANVDNNNGWDSWNS 
IWDYGNGFAATRLFQKKTCIVHKMNKEVMPSIQSLDALVKEKKLQGKGPGGPPPKGLMYSVN 
PNKVDDLSKFGKNIANMCRGIPTYMAEEMQEASLFFYSGTCYTTSVLWIVDISFCGDTVEN 
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FIGURE 140 

CATTTCTGAAACTAATCGTGTCAGAATTGACTTTGAAAAGCATTGCTTTTTACAGAAGTATA 

TTAACTTTTTAGGAGTAATTTCTAGTTTGGATTGTAATATGAAATAATTTAAAAGGGCTTCG 

CTCATATATAGGAAAATCGCATATGGTCCTAGTATTAAATTCTTATTGCTTACTGATTTTTT 

TGAGTTAAGAGTTGTTATATGCTAGAATATGAGGATGTGAATATAAATAAGAGAAGAAAAAA 

GAATAAAGTAGATTGAGTCTCCAATTTTATGTAAGCTTCAGAAGAACTGGTTTGTTTACATG 

CAAGCTTATAGTTGAAATATTTTTCAGGAATTACATGAATGACAGTCTTCGAACCAATGTGT 

TTGTTCGATTTCAACCAGAGACTATAGCATGTGCTTGCATCTACCTTGCAGCTAGAGCACTT 

CAGATTCCGTTGCCAACTCGTCCCCATTGGTTTCTTCTTTTTGGTACTACAGAAGAGGAAAT 

CCAGGAAATCTGCATAGAAACACTTAGGCTTTATACCAGAAAAAAGCCAAACTATGAATTAC 

TGGAAAAAGAAGTAGAAAAAAGAAAAGTAGCCTTACAAGAAGCCAAATTAAAAGCAAAGGGA 

TTGAATCCGGATGGAACTCCAGCCCTTTCAACCCTGGGTGGATTTTCTCCAGCCTCCAAGCC 

ATCATCACCAAGAGAAGTAAAAGCTGAAGAGAAATCACCAATCTCCATTAATGTGAAGACAG 

TCAAAAAAGAACCTGAGGATAGACAACAGGCTTCCAAAAGCCCTTACAATGGTGTAAGAAAA 

GACAGCAAGAGAAGTAGAAATAGCAGAAGTGCAAGTCGATCGAGGTCAAGAACACGATCACG 

TTCTAGATCACATACTCCAAGAAGACACTATAATAATAGGCGGAGTCGATCTGGAACATACA 

GCTCGAGATCAAGAAGCAGGTCCCGCAGTCACAGTGAAAGCCCTCGAAGACATCATAATCAT 

GGTTCTCCTCACCTTAAGGCCAAGCATACCAGAGATGATTTAAAAAGTTCAAACAGACATGG 

TCATAAAAGGAAAAAATCTCGTTCTCGATCTCAGAGCAAGTCTCGGGATCACTCAGATGCAG 

CCAAGAAACACAGGCATGAAAGGGGACATCATAGGGACAGGCGTGAACGATCTCGCTCCTTT 

GAGAGGTCCCATAAAAGCAAGCACCATGGTGGCAGTCGCTCAGGACATGGCAGGCACAGGCG 

CIS&CTTTCTCTTCCTTTGAGCCTGCATCAGTTCTTGGTTTTGCCTATCTACAGTGTGATGT 

ATGGACTCAATCAAAAACATTAAACGCAAACTGATTAGGATTTGATTTCTTGAAACCCTCTA 

GGTCTCTAGAACACTGAGGACAGTTTCTTTTGAAAAGAACTATGTTAATTTTTTTGCACATT 

AAAATGCCCTAGCA6TATCTAATTAAAAACCATGGTCAGGTTCAATTGTACTTTATTATAGT 

TGTGTATTGTTTATTGCTATAAGAACTGGAGCGTGAATTCTGTAAAAATGTATCTTATTTTT 

ATACAGATAAAATTGCAGACACTGTTCTATTTAAGTGGTTATTTGTTTAAATGATGGTGAAT 

ACTTTCTTAACACTGGTTTGTCTGCATGTGTAAAGATTTTTACAAGGAAATAAAATACAAAT 
CTTGTTTTTTCTAAAAAAAAAAAAAAAAAAGT 
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FIGURE 141 

MNDSUlTNVFVRFQPETIACACIYIJU^QIPLPTRPHWFLLFGTTEEEIQEICIETLRLy 
TRKKPNYEIiEKEVEKRKVAUJEAKLKAKGLNPDGTPALSTLGGFSPASKPSSPREVKAEEK 
SPISINVKTVKKEPEDRQQASKSPYNGVRKDSKRSRNSRSASRSRSRTRSRSRSHTPRRHYN 
NRRSRSGTYSSRSRSRSRSHSESPRRHHNHGSPHLKAKHTRDDLKSSNRHGHKRKKSRSRSQ 
SKSRDHSDAAKKHRHERGHHRDRRERSRSFERSHKSKHHGGSRSGHGRHRR 
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FIGURE 142 
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TGGGGATAAAGGAAAAATGGTCAGGTATTAATGGCTTAAAGATTATTGGAAGGGGTTTATCA 

TTTTTTGAANNTATTCGGGTCANAATTGNCTTTGAAAAGCATTGCTTTTTACAGAAATATAT 

TANCTTTTTAGAGTAATTTCTAGTTTGGATTGTAATATGAAATTATTTAAAAGGGCTTCGCT 

CATATATAGGAAAATCGCATATGGTCCTAGTATTAAATTNTTATTGCTTACTGATTTTTTTG 

AGTTAAGAGTTGTTATATGNTAGAATATGAGGATGTGAATATAAATAAGAGAAGAAAAAAGA 

ATAAAGTAGATTGAGTCTCCAATTTTATGTAAGCTTCAGAAGAACTGGTTTGTTTACATGCA 

AGCTTATAGTTGAAATATTTTTCAGGAATTACATGAATGACAGTCTTCGAACCAATGTGTTT 

GTTCGATTTCAACCAGAGANTATAGCATGTGCTTGCATCTACCTTGCAGNTAGAGCACTTCA 

GATTCCGTTGCCAACTNGTCCCCATTGGTTTCTTCTTTTTGGTACTACAGAAGAGGAAATCC 

AGGAAATNTGCATAGAAACACTTAGGCTTTATACCAGAAAAAAGCCAAACTATGAATTACTG 

GAAAAAGAAGTAGAAAAAAGAAAAGTAGCCTTACAAGAAGCCNAATTAAAAGCAAAGGGATT 

GAATCCGGATGGAACTCCAGCCCTTTCAACCCTGGGTGGATTTTCTCC 
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FIGURE 14^ 
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GGCACGAGGCCTCGTGCCAAGCTTGGCACGAGGGTGCACCGCGTTCTCGCACGCGTCATGGC 

GGTCCTCGGAGTACAGCTGGTGGTGACCCTGCTCACTGCCACCCTCATGCACAGGCTGGCGC 

CACACTGCTCCTTCGCGCGCTGGCTGCTCTGTAACGGCAGTTTGTTCCGATACAAGCACCCG 

TCTGAGGAGGAGCTTCGGGCCCTGGCGGGGAAGCCGAGGCCCAGAGGCAGGAAAGAGCGGTG 

GGCCAATGGCCTTAGTGAGGAGAAGCCACTGTCTGTGCCCCGAGATGCCCCGTTCCAGCTGG 

AGACCTGCCCCCTCACGACCGTGGATGCCCTGGTCCTGCGCTTCTTCCTGGAGTACCAGTGG 

TTTGTGGACTTTGCTGTGTACTCGGGCGGCGTGTACCTCTTCACAGAGGCCTACTACTACAT 

GCTGGGACCAGCCAAGGAGACTAACATTGCTGTGTTCTGGTGCCTGCTCACGGTGACCTTCT 

CCATCAAGATGTTCCTGACAGTGACACGGCTGTACTTCAGCGCCGAGGAGGGGGGTGAGCGC 

TCTGTCTGCCTCACCTTTGCCTTCCTCTTCCTGCTGCTGGCCATGCTGGTGCAAGTGGTGCG 

GGAGGAGACCCTCGAGCTGGGCCTGGAGCCTGGTCTGGCCAGCATGACCCAGAACTTAGAGC 

CACTTCTGAAGAAGCAGGGCTGGGACTGGGCGCTTCCTGTGGCCAAGCTGGCTATCCGCGTG 

GGACTGGCAGTGGTGGGCTCTGTGCTGGGTGCCTTCCTCACCTTCCCAGGCCTGCGGCTGGC 

CCAGACCCACCGGGACGCACTGACCATGTCGGAGGACAGACCCATGCTGCAGTTCCTCCTGC 

ACACCAGCTTCCTGTCTCCCCTGTTCATCCTGTGGCTCTGGACAAAGCCCATTGCACGGGAC 

TTCCTGCACCAGCCGCCGTTTGGGGAGACGCGTTTCTCCCTGCTGTCCGATTCTGCCTTCGA 

CTCTGGGCGCCTCTGGTTGCTGGTGGTGCTGTGCCTGCTGCGGCTGGCGGTGACCCGGCCCC 

ACCTGCAGGCCTACCTGTGCCTGGCCAAGGCCCGGGTGGAGCAGCTGCGAAGGGAGGCTGGC 

CGCATCGAAGCCCGTGAAATCCAGCAGAGGGTGGTCCGAGTCTACTGCTATGTGACCGTGGT 

GAGCTTGCAGTACCTGACGCCGCTCATCCTCACCCTCAACTGCACAGTTCTGCTCAAGACGC 

TGGGAGGCTATTCCTGGGGCCTGGGCCCAGCTCCTCTACTATCCCCCGACCCATCCTCAGCC 

AGCGCTGCCCCCATCGGCTCTGGGGAGGACGAAGTCCAGCAGACTGCAGCGCGGATTGCCGG 

GGCCCTGGGTGGCCTGCTTACTCCCCTCTTCCTCCGTGGCGTCCTGGCCTACCTCATCTGGT 

GGACGGCTGCCTGCCAGCTGCTCGCCAGCCTTTTCGGCCTCTACTTCCACCAGCACTTGGCA 

GGCTCCTAGCTGCCTGCAGACCCTCCTGGGGCCCTGAGGTCTGTTCCTGGGGCAGCGGGACA 

CTAGCCTGCCCCCTCTGTTTGCGCCCCCGTGTCCCCAGCTGCAAGGTGGGGCCGGACTCCCC 

GGCGTTCCCTTCACCACAGTGCCTGACCCGCGGCCCCCCTTGGACGCCGAGTTTCTGCCTCA 

GAACTGTCTCTCCTGGGCCCAGCAGCATGAGGGTCCCGAGGCCATTGTCTCCGAAGCGTATG 

TGCCAGGTTTGAGTGGCGAGGGTGATGCTGGCTGCTCTTCTGAACAAATAAAGGAGCATGCC 
GATTTTTAA 
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FIGURE 144 



^^VLGVQLVVTLLTATLMHRLAPHCSFARWLLCNGSLFRYKHPSEEELRALAGKPRPRGRKE 

RWANGLSEEKPLSVPRDAPFQLETCPLTTVDALVLRFFLEYQWFVDFAVYSGGVYLFTEAYY 

YMLGPAKETNIAVFWCLLTVTFSIKMFLTVTRLYFSAEEGGERSVCLTFAFLFLLL^ 

VREETLELGLEPGIJ^MTQNLEPLLKKQGWDWALPVAKLAIRVGLAWGSVLGAFLTFPGIJ^ 

LAQTHRDALTMSEDRPMLQFLLHTSFLSPLFILWLWTKPIARDFLHQPPFGETRFSLLSDSA 

FDSGRLWLLWLCLLRLiAVTRPHLQAYLCLAKARVEQLRREAGRIEAREIQQRWRVYCYVT 

WSLQYLTPLILTLNCTLLLKTLGGYSWGLGPAPLLSPDPSSASAAPIGSGEDEVQQTAARI 

AGALGGLLTPLFLRGVLAYLIWWTAACQLLASLFGLYFHQHLAGS 
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FIGURE 145 
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CGTTNGCACGCGTCAATGGCGGTCCTCGGAGTACAGCTGGTGGTGACCCTGCTCACTGCCAC 

CCTCATGCACAGGCTGGCGCCACACTGCTCCTTCGCGCGCTGGCTGCTCTGTAACGGCAGTT 

TGTTCCGATACAAGCACCCGTNTTGAGGAGGAGCTTCGGGCCCTGGCGGGGAAGCCGAGGCC 

CAGAGGOVGGAAAGAGCGGTGGGCCT^TGGCCTTAGTGAGGAGAAGCCACrGTCTGTGC 

GAGATGCCCCGTTCCAGCTGGAGACCTGCCCCCTCACGACCGTGGATGCCCTGGTCCTGCGC 

TTCTTCCTGGAGTACCAGTGGTTTGTGGACTTTGCTGTGTACTCGGGCGGCGTGTACCTCTT 

CACAGAGGCCTACTACTACATGCTGGGACCAGCCAAGGAGACTAACATTGCTGTGTTCTGGT 

GCCTGCTCACAGTGACCTTCTCCATCAAGATGTTCCTGACAGTGACACGGCTGTACTTCAGC 

GCCGAGGAGGGGGGTGAGCGCTCTGTCTGCCTCACCTTTGCCTTCCTCTTCCTGCTGCTGGC 
CATGCTGGTGCAAGCG 



t 
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FIGURE 14r> 



GGTTCCTACATCCTCTCATCTGAGAATCAGAGAGCATAATCTTCTTACGGGCCCGTGATTTA 

TTAACGTGGCTTAATCTGAAGGTTCTCAGTCAAATTCTTTGTGATCTACTGATTGTGGGGGC 

ATGGCAAGGTTTGCTTAAAGGAGCTTGGCTGGTTTGGGCCCTTGTAGCTGACAGAAGGTGGC 

CAGGGAGAATGCAGCACACTGCTCGGAGAATSAAGGCGCTTCTGTTGCTGGTCTTGCCTTGG 

CTCAGTCCTGCTAACTACATTGACAATGTGGGCAACCTGCACTTCCTGTATTCAGAACTCTG 

TAAAGGTGCCTCCCACTACGGCCTGACCAAAGATAGGAAGAGGCGCTCACAAGATGGCTGTC 

CAGACGGCTGTGCGAGCCTCACAGCCACGGCTCCCTCCCCAGAGGTTTCTGCAGCTCCCACC 

ATCTCCTTAATGACAGACGAGCCTGGCCTAGACAACCCTGCCTACGTGTCCTCGGCAGAGGA 

CGGGCAGCCAGCAATCAGCCCAGTGGACTCTGGCCGGAGCAACCGAACTAGGGCACGGCCCT 

TTGAGAGATCCACTATTAGAAGCAGATCATTTAAAAAAATAAATCGAGCTTTGAGTGTTCTT 

CGAAGGACAAAGAGCGGGAGTGCAGTTGCCAACCATGCCGACCAGGGCAGGGAAAATTCTGA 

AAACACCACTGCCCCTGAAGTCTTTCCAAGGTTGTACCACCTGATTCCAGATGGTGAAATTA 

CCAGCATCAAGATCAATCGAGTAGATCCCAGTGAAAGCCTCTCTATTAGGCTGGTGGGAGGT 

AGCGAAACCCCACTGGTCCATATCATTATCCAACACATTTATCGTGATGGGGTGATCGCCAG 

AGACGGCCGGCTACTGCCAGGAGACATCATTCTAAAGGTCAACGGGATGGACATCAGCAATG 

TCCCTCACAACTACGCTGTGCGTCTCCTGCGGCAGCCCTGCCAGGTGCTGTGGCTGACTGTG 

ATGCGTGAACAGAAGTTCCGCAGCAGGAACAATGGACAGGCCCCGGATGCCTACAGACCCCG 

AGATGACAGCTTTCATGTGATTCTCAACAAAAGTAGCCCCGAGGAGCAGCTTGGAATAAAAC 

TGGTGCGCAAGGTGGATGAGCCTGGGGTTTTCATCTTCAATGTGCTGGATGGCGGTGTGGCA 

TATCGACATGGTCAGCTTGAGGAGAATGACCGTGTGTTAGCCATCAATGGACATGATCTTCG 

ATATGGCAGCCCAGAAAGTGCGGCTCATCTGATTCAGGCCAGTGAAAGACGTGTTCACCTCG 

TCGTGTCCCGCCAGGTTCGGCAGCGGAGCCCTGACATCTTTCAGGAAGCCGGCTGGAACAGC 

AATGGCAGCTGGTCCCCAGGGCCAGGGGAGAGGAGCAACACTCCCAAGCCCCTCCATCCTAC 

AATTACTTGTCATGAGAAGGTGGTAAATATCCAAAAAGACCCCGGTGAATCTCTCGGCATGA 

CCGTCGCAGGGGGAGCATCACATAGAGAATGGGATTTGCCTATCTATGTCATCAGTGTTGAG 

CCCGGAGGAGTCATAAGCAGAGATGGAAGAATAAAAACAGGTGACATTTTGTTGAATGTGGA 

TGGGGTCGAACTGACAGAGGTCAGCCGGAGTGAGGCAGTGGCATTATTGAAAAGAACATCAT 

CCTCGATAGTACTCAAAGCTTTGGAAGTCAAAGAGTATGAGCCCCAGGAAGACTGCAGCAGC 

CCAGCAGCCCTGGACTCCAACCACAACATGGCCCCACCCAGTGACTGGTCCCCATCCTGGGT 

CATGTGGCTGGAATTACCACGGTGCTTGTATAACTGTAAAGATATTGTATTACGAAGAAACA 

CAGCTGGAAGTCTGGGCTTCTGCATTGTAGGAGGTTATGAAGAATACAATGGAAACAAACCT 

TTTTTCATCAAATCCATTGTTGAAGGAACACCAGCATACAATGATGGAAGAATTAGATGTGG 

TGATATTCTTCTTGCTGTCAATGGTAGAAGTACATCAGGAATGATACATGCTTGCTTGGCAA 

GACTGCTGAAAGAACTTAAAGGAAGAATTACTCTAACTATTGTTTCTTGGCCTGGCACTTTT 

TTATAGAATCAATGATGGGTCAGAGGAAAACAGAAAAATCACAAATAGGCTAAGAAGTTGAA 

ACACTATATTTATCTTGTCAGTTTTTATATTTAAAGAAAGAATACATTGTAAAAATGTCAGG 

AAAAGTATGATCATCTAATGAAAGCCAGTTACACCTCAGAAAATATGATTCCAAAAAAATTA 

AAACTACTAGTTTTTTTTCAGTGTGGAGGATTTCTCATTACTCTACAACATTGTTTATATTT 

TTTCTATTCAATAAAAAGCCCTAAAACAACTAAAATGATTGATTTGTATACCCCACTGAATT 

CAAGCTGATTTAAATTTAAAATTTGGTATATGCTGAAGTCTGCCAAGGGTACATTATGGCCA 

TTTTTAATTTACAGCTAAAATATTTTTTAAAATGCATTGCTGAGAAACGTTGCTTTCATCAA 
ACAAGAATAAATATTTTTCAGAAGTTAAA 
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FIGURE 147 
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MKALLLLVLPWLSPANYIDNVGNLHFLYSELCKGASHYGLTKDRKRRSQDGCPDGCASLTAT 

APSPEVSAAATISLMTDEPGLDNPAYVSSAEDGQPAISPVDSGRSNRTRARPFERSTIRSRS 

FKKINRALSVLRRTKSGSAVANHADQGRENSENTTAPEVFPRLYHLIPDGEITSIKINRVDP 

SESLSIRLVGGSETPLVHIIIQHIYRDGVIARDGRLLPGDIILKVNGMDISNVPHNYAVRLL 

RQPCQVLWLTVMREQKFRSRNNGQAPDAYRPRDDSFHVILNKSSPEEQLGIKLVRKVDEPGV 

FIFNVLDGGVAYRHGQLEENDRVLAINGHDLRYGSPESAAHLIQASERRVHLWSRQVRQRS 

PDIFQEAGWNSNGSWSPGPGERSNTPKPLHPTITCHEKWNIQKDPGESLGMTVAGGASHRE 

WDLPIYVISVEPGGVISRDGRIKTGDILLNVDGVELTEVSRSEAVALLKRTSSSIVLKALEV 

KEYEPQEDCSSPAALDSNHNMAPPSDWSPSWVMWLELPRCLYNCKDIVLRRNTAGSLGFCIV 

GGYEEYNGNKPFFIKSIVEGTPAYNDGRIRCGDILLAVNGRSTSGMIHACLARLLKELKGRI 
TLTIVSWPGTFL 
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FIGURE UH 
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CCAAAGTGATCATTTGAAAAAGAGATATCCACATCTTCAAGCCCATATAAAGGATAGAAGCT 

GCACAGGGCAGCTTTACTTACTCCAGCACCTTCCTCTCCCAGGCAA&ISGTGCTGACCATCT 

TTGGGATACAATCTCATGGATACGAGGTTTTTAACATCATCAGCCCAAGCAACAATGGTGGC 

AATGTTCAGGAGACAGTGACAATTGATAATGAAAAAAATACCGCCATCGTTAACATCCATGC 

AGGATCATGCTCTTCTACCACAATTTTTGACTATAAACATGGCTACATTGCATCCAGGGTGC 

TCTCCCGAAGAGCCTGCTTTATCCTGAAGATGGACCATCAGAACATCCCTCCTCTGAACAAT 

CTCCAATGGTACATCTATGAGAAACAGGCTCTGGACAACATGTTCTCCAACAAATACACCTG 

GGTCAAGTACAACCCTCTGGAGTCTCTGATCAAAGACGTGGATTGGTTCCTGCTTGGGTCAC 

CCATTGAGAAACTCTGCAAACATATCCCTTTGTATAAGGGGGAAGTGGTTGAAAACACACAT 

AATGTCGGTGCTGGAGGCTGTGCAAAGGCTGGGCTCCTGGGCATCTTGGGAATTTCAATCTG 

TGCAGACATTCATGTTI^ATGATTAGCCCTCTTGTTTTATCTTTTCAAAGAAATACATCC 

TTGGTTTACACTCAAAAGTCAAATTAAATTCTTTCCCAATGCCCCAACTAATTTTGAGATTC 
AGTCAGAAAATATAAATGCTGTATTTATA 
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FIGURE 14Q 

MKILVAFLVVLTIFGIQSHGYEVFNIISPSNNGGNVQETVTIDNEKNTAIVNIHAGSCSSTT 
IFDYKHGYIASRVLSRRACFILKMDHQNIPPLNNLQWYIYEKQALDNMFSNKYTWVKYNPLE 
SLIKDVDWFLLGSPIEKLCKHIPLYKGEWENTHNVGAGGCAKAGLLGILGISICADIHV 
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FIGURE 150 

GGCACGAGCCAGGAACTAGGAGGrrCTCACTGCCCGAGCAGAGGCCCTACACCCACCGAGGC 
ATGG GGCTCCCTGGGCTGTTCTGCTTGGCCGTGCTGGCTGCCAGCAGCTTCTCCAAGGCACG 
GGAGGAAGAAATTACCCCTGTGGTCTCCATTGCCTACAAAGTCCTGGAAGTTTTCCCCAAAG 
GCCGCTGGGTGCTCATAACCTGCTGTGCACCCCAGCCACCACCGCCCATCACCTATTCCCTC 
TGTGGAACCAAGAACATCT^GGTGGCCAAGAAGGTGGTGAAGACCCACGAGCCGGCCTCCTT 
CAACCTCAACGTCACACTCAAGTCCAGTCCAGACCTGCTCACCTACTTCTGCCGGGCGTCCT 
CCACCTCAGGTGCCCATGTGGACAGTGCCAGGCTACAGATGCACTGGGAGCTGTGGTCCAAG 
CCAGTGTCTGAGCTGCGGGCCAACTTCACTCTGCAGGACAGAGGGGCAGGCCCCAGGGTGGA 
GATGATCTGCCAGGCGTCCTCGGGCAGCCCACCTATCACCAACAGCCTGATCGGGAAGGATG 
GGCAGGTCCACCTGCAGCAGAGACCATGCCACAGGCAGCCTGCCAACTTCTCCTTCCTGCCG 
AGCCAGACATCGGACTGGTTCTGGTGCCAGGCTGCAAACAACGCCAATGTCCAGCACAGCGC 
CCTCACAGTGGTGCCCCCAGGTGGTGACCAGAAGATGGAGGACTGGCAGGGTCCCCTGGAGA 
GCCCCATCCTTGCCTTGCCGCTCTACAGGAGCACCCGCCGTCTGAGTGAAGAGGAGTTTGGG 
GGGTTCAGGATAGGGAATGGGGAGGTCAGAGGACGCAAAGCAGCAGCCATG TAGA ATGAACC 
GTCCAGAGAGCCAAGCACGGCAGAGGACTGCAGGCCATCAGCGTGCACTGTTCGTATTTGGA 
GTTCATGCAAAATGAGTGTGTTTTAGCTGCTCTTGCCACAAAAAAAAAAAAi\AAAAAAAAAA 
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FIGURE 151 

MGLPGLFCLAVLAASSFSKAREEEITPWSIAYKVLEVFPKGRWVLITCCAPQPPPPITYSL 

CGTKNIKVAKKVVKTHEPASFNIJm'LKSSPDIiTYFCRASSTSGAHVDSARI^MHWELWSK 

PVSELRANFTLQDRGAGPRVEMICQASSGSPPITNSLIGKDGQVHLQQRPCHRQPANFSFLP 

SQTSDWFWCQAANNANVQHSALTWPPGGDQKMEDWQGPLESPIIALPLYRSTRRLSEEEFG 
GFRIGNGEVRGRKAAAM 
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FIGURE 152 
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GGTCCTTAATGGCAGCAGCCGCCGCTACCAAGATCCTTCTGTGCCTCCCGCTTCTGCTCCTG 

CTGTCCGGCTGGTCCCGGGCTGGGCGAGCCGACCCTCACTCTCTTTGCTATGACATCACCGT 

CT^TCCCTAAGTTCAGACCTGGACCACGGTGGTGTGCGGTTCAAGGCCAGGTGGATGAAAAGA 

CTTTTCTTCACTATGACTGTGGCAACAAGACAGTCACACCTGTCAGTCCCCTG^ 

CTAAATGTCACAACGGCCTGGAAAGCACAGAACCCAGTACTGAGAGAGGTGGTGGACATACT 

TACAGAGCT^CTGCGTGAOVTTCT^GCTGGAGAATTACACACCCAAGGAACCCCTC^ 

AGGCAAGGATGTCTTGTGAGCAGAAAGCTGAAGGACACAGCAGTGGATCTTGGCAGTTCAGT 

TTCGATGGGCAGATCTTCCTCCTCTTTGACTCAGAGAAGAGAATGTGGACAACGGTTCATCC 

TGGAGCCAGAAAGATGAAAGAAAAGTGGGAGAATGACAAGGTTGTGGCCATGTCCTTCCATT 

ACTTCTCAATGGGAGACTGTATAGGATGGCTTGAGGACTTCTTGATGGGCATGGACAGCACC 

CTGGAGCCAAGTGCAGGAGCACCACTCGCCATGTCCTCAGGCACAACCCAACTCAGGGCCAC 

AGCCACCACCCTCATCCTTTGCTGCCTCCTCATCATCCTCCCCTGCTTCATCCTCCCTGGCA 

TCTGAGGAGAGTCCTTTAGAGTGACAGGTTAAAGCTGATACCAAAAGGCTCCTGTGAGCACG 

GTCTTGATCAAACTCGCCCTTCTGTCTGGCCAGCTGCCCACGACCTACGGTGTATGTCCAGT 

GGCCTCCAGCAGATCATGATGACATCATGGACCCAATAGCTCATTCACTGCCTTGATTCCTT 

TTGCCAACAATTTTACCAGCAGTTATACCTAACATATTATGCAATTTTCTCTTGGTGCTACC 

TGATGGAATTCCTGCACTTAAAGTTCTGGCTGACTAAACAAGATATATCATTTTCTTTCTTC 

TCTTTTTGTTTGGAAAATCAAGTACTTCTTTGAATGATGATCTCTTTCTTGCAAATGATATT 

GTCAGTAAAATAATCACGTTAGACTTCAGACCTCTGGGGATTCTTTCCGTGTCCTGAAAGAG 

AATTTTTAAATTATTTAATAAGAAAAAATTTATATTAATGATTGTTTCCTTTAGTAATTTAT 

TGTTCTGTACTGATATTTAAATAAAGAGTTCTATTTCCCAAAAAAAAAAAAAAAAAA 
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FIGURE 153 

MAAAAATKILLCLPLLLLLSGWSRAGRADPHSLCYD ITVI PKFRPGPRWCAVQGQVDEKTFL 
HYDCGNKTVTPVSPl^KKLNVTTAWKAQNPVLREVVDILTEQLR^ 
MSCEQKAEGHSSGSWQFSFDGQIFLLFDSEKRMWTTVHPGARKMKE^ 
MGDCIGWLEDFLMGMDSTLEPSAGAPLAMSSGTTQLRATATTLILCCLLIILPCFILPGI 
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GGGAAAGCCATTTCGAAAACCCATCTATACAAACTATATATTTTCATTTCTGCTGCTAGCTG 

CCTTGGGCCTCACAATTTTa^TTCTGTTTTCTGACTTTCAAGTTATATACCGTGGAATC^ 

TTGATCCCAACCATAACATCGTGGAGGGTTTTAATTTTGGTGGTAGCCCTCACCCAATTCTG 

GTGTGGCTTTCTTTGCAGAGGATTCCACCTTCAAAATCATGAACTCTGGCTGTTQATCAA7VA 

GAGAATTTGGATTCTACTCTAAAAGTCAATATAGGACTTGGCAAAAGAAGCTAGCAGAAGAC 

TCAACCTGGCCTCCCATAAACAGGACAGATTATTCAGGTGATGGCAAAAATGGATTCTACAT 

CAACGGAGGCTATGAAAGCCATGAACAGATTCCAAAAAGAAAACTCAAATTGGGAGGCCAAC 

CCACAGAACAGCATTTCTGGGCCAGGCTGTAATCAGAATTGTCGTCGTACATGCTCAACAGC 

ATTGCTTTTTTCCCCAAAATTAACACATTGTGGAGAAGTGATGATACTCTCCCCTTACCTTT 

CCTCTCTCCATTCAAGCATTCAAAGTATATTTTCAATGAATTAAACCTTGCAGCAAGGGACC 

TTAGATAGGCTTATTCTGACTGTATGCTTTACCAATGAGAGAAAAAAATGCATTTCCTGTAT 

aiTCCTTTTCAATAAACTGTATTCATTTTGAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 155 



MELIPTITSWRVLILWALTQFWCGFLCRGFHLQNHELWLLIKREFGFYSKSQYRTWQKKLA 
EDSTWPPINRTDYSGDGKNGFYINGGYESHEQIPKRKLKLGGQPTEQHFWARL 
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FIGURE 156 

GTTCTCCTTTCCGAGCCAAAATCCCAGGCGATGGTGAATTATGAACGTGCCACACCATGAAG 
CTCTTGTGGCAGGTAACTGTGCACCACCAC7VCCTGGAATGCCATCCTGCTCCCGTTCGTCTA 
CCTCACGGCGCy^GTGTGGATTCTGTGTGCAGCCATCGCTGCTGCCGCCTCAGCCGGGCCCC 
AGAACTGCCCCTCCGTTTGCTCGTGCAGTAACCAGTTCAGCAAGGTGGTGTGCACGCGCCGG 
GGCCTCTCCGAGGTCCCGCAGGGTATTCCCTCGAACACCCGGTACCTCAACCTCATGGAGAA 
CAACATCCAGATGATCCAGGCCGACACCTTCCGCCACCTCCACCACCTGGAGGTCCTGCAGT 
TGGGCAGGAACTCCATCCGGCAGATTGAGGTGGGGGCCTTCAACGGCCTGGCCAGCCTCAAC 
ACCCTGGAGCTGTTCGACAACTGGCTGACAGTCATCCCTAGCGGGGCCTTTGAATACCTGTC 
CAAGCTGCGGGAGCTCTGGCTTCGCAACAACCCCATCGAAAGCATCCCCTCTTACGCCTTCA 
ACCGGGTGCCCTCCCTCATGCGCCTGGACTTGGGGGAGCTCAAGAAGCTGGAGTATATCTCT 
GAGGGAGCTTTTGAGGGGCTGTTCAACCTCAAGTATCTGAACTTGGGCATGTGCAACATTAA 
AGACATGCCCAATCTCACCCCCCTGGTGGGGCTGGAGGAGCTGGAGATGTCAGGGAACCACT 
TCCCTGAGATCAGGCCTGGCTCCTTCCATGGCCTGAGCTCCCTCAAGAAGCTCTGGGTCATG 
AACTCACAGGTCAGCCTGATTGAGCGGAATGCTTTTGACGGGCTGGCTTCACTTGTGGAACT 
CAACTTGGCCCACa^TAACCTCTCrTCTTTGCCCCATGACCTCTTTACCCCGCTGAGGTACC 
TGGTGGAGTTGCATCTACACCACAACCCTTGGAACTGTGATTGTGACATTCTGTGGCTAGCC 
TGGTGGCTTCGAGAGTATATACCCACCAATTCCACCTGCTGTGGCCGCTGTCATGCTCCCAT 
GCACATGCGAGGCCGCTAGCTCGTGGAGGTGGACCAGGCCTCCTTCCAGTGCTCTGCCCCCT 
TCATCATGGACGCACCTCGAGACCTCAACATTTCTGAGGGTCGGATGGCAGAACTTAAGTGT 
CGGACTCCCCCTATGTCCTCCGTGAAGTGGTTGCTGCCCAATGGGACAGTGCTCAGCCACGC 
CTCCCGCCACCCAAGGATCTCTGTCCTCAACGACGGCACCTTGAACTTTTCCCACGTGCTGC 
TTTCAGACACTGGGGTGTACACATGCATGGTGACCAATGTTGCAGGCAACTCCAACGCCTCG 
GCCTACCTCAATGTGAGCACGGCTGAGCTTAACACCTCCAACTACAGCTTCTTCACCACAGT 
AACAGTGGAGACCACGGAGATCTCGCCTGAGGACACAACGCGAAAGTACAAGCCTGTTCCTA 
CCACGTCCACTGGTTACCAGCCGGCATATACCACCTCTACCACGGTGCTCATTCAGACTACC 
CGTGTGCCCAAGCAGGTGGCAGTACCCGCGACAGACACCACTGACAAGATGCAGACCAGCCT 
GGATGAAGTCATGAAGACCACCAAGATCATCATTGGCTGCTTTGTGGCAGTGACTCTGCTAG 
CTGCCGCCATGTTGATTGTCTTCTATAAACTTCGTAAGCGGCACCAGCAGCGGAGTACAGTC 
ACAGCCGCCCGGACTGTTGAGATAATCCAGGTGGACGAAGACATCCCAGCAGCAACATCCGC 
AGCAGCAACAGCAGCTCCGTCCGGTGTATCAGGTGAGGGGGCAGTAGTGCTGCCCACAATTC 
ATGACCATATTAACTACAACACCTACAAACCAGCACATGGGGCCCACTGGACAGAAAACAGC 
CTGGGGAACTCTCTGCACCCCACAGTCACCACTATCTCTGAACCTTATATAATTCAGACCCA 
TACCAAGGACAAGGTACAGGAAACTCAAATATGACTCCCCTCCCCCAAAAAACTTATAAAAT 
GCAATAGAATGCACACAAAGACAGCAACTTTTGTACAGAGTGGGGAGAGACTTTTTCTTGTA 
TATGCTTATATATTAAGTCTATGGGCTGGTTAAAAAAAACAGATTATATTAAAATTTAAAGA 
CAAAAAGTCAAAACA 
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FIGURE 157 

MKIJjWQVTVHHHTWNAIIJjPFVYLTAQWILCAAIAAAASAG 
RRGLSEVPQGIPSNTRYLNLMENNIQMIQADTFRHLHHI^VLQ^^ 

LNTLELFDNWLTVI PSGAFEYLSKLRELWLRNNP lES I PS YAFNRVPSLMRLDLGELKKLEY 

ISEGAFEGLFNLKYLNLGMCNIKDMPNLTPLVGLEELEMSGNHFPEIRPGSFHGLSSLKKLW 

VMNSQVSLIERNAFDGIASLVELNLAHNNLSSLPHDLFTPLRYLVELH^ 

IiAWWLREYIPTNSTCCGRCHAPMHMRGRYLVEVDQASFQCSAPFIMDAPRDLNISEGRMAE^ 

KCRTPPMSSVKWLLPNGTVLSHASRHPRISVLNDGTLNFSHVLLSDTGVYTCM\n^ 

AS AYLNVSTAELNTSNYS FFTTVTVETTE I SPEDTTRKYKPVPTTSTG YQPAYTTSTTVL I Q 

TTRVPKQVAVPATDTTDKMQTSLDEVMKTTKI I IGCFVAVTLLAAAMLIVFYKLRKRHQQRS 

TVTAARTVE 1 1 QVDED I PAATSAAATAAPSGVSGEGAWLPT IHDH INYNTYKP AHGAHWTE 

NSLGNSLHPTVTTISEPYI IQTHTKDKVQETQI 
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FIGURE 158 

CGCTCGGGCACCAGCCGCGGCAAGGATGGAGCTGGGTTGCTGGACGCAGTTGGGGCTCACTT 

TTCTTCAGCTCCTTCrCATCTCGTCCTTGCCAAGAGAGTACACAGTCATTAATG 

CCTGGAGCAGAGTGGAATATCATGTGTCGGGAGTGCTGTGAATATGATCAGATTGAGTGCGT 

CTGCCCCGGAAAGAGGGAAGTCGTGGGTTATACCATCCCTTGCTGCAGGAATGAGGAGAATG 

AGTGTGACTCCTGCCTGATCCACCCAGGTTGTACCATCTTTGAAAACTGCAAGAGCTGCCGA 

AATGGCTCATGGGGGGGTACCTTGGATGACTTCTATGTGAAGGGGTTCTACTGTGCAGAGTG 

CCGAGCAGGCTGGTACGGAGGAGACTGCATGCGATGTGGCCAGGTTCTGCGAGCCCCAAAGG 

GTCAGATTTTGTTGGAAAGCTATCCCCTA7VATGCTCACTGTGAATGGACCATTCATGCTAAA 

CCTGGGTTTGTCATCCAACTAAGATTTGTCATGTTGAGTCTGGAGTTTGACTACATGTGCCA 

GTATGACTATGTTGAGGTTCGTGATGGAGACAACCGCGATGGCCAGATCATCAAGCGTGTCT 

GTGGCAACGAGCGGCCAGCTCCTATCCAGAGCATAGGATCCTCACTCCACGTCCTCTTCCAC 

TCCGATGGCTCCAAGAATTTTGACGGTTTCCATGCCATTTATGAGGAGATCACAGCATGCTC 

CTCATCCCCTTGTTTCCATGACGGCACGTGCGTCCTTGACAAGGCTGGATCTTACAAGTGTG 

CCTGCTTGGCAGGCTATACTGGGCAGCGCTGTGAAAATCTCCTTGAAGAAAGAAACTGCTCA 

GACCCTGGGGGCCCAGTCAATGGGTACCAGAAAATAACAGGGGGCCCTGGGCTTATCAACGG 

ACGCCATGCTAAAATTGGCACCGTGGTGTCTTTCTTTTGTAACAACTCCTATGTTCTTAGTG 

GCAATGAGAAAAGAACTTGCCAGCAGAATGGAGAGTGGTCAGGGAAACAGCCCATCTGCATA 

AAAGCCTGCCGAGAACCAAAGATTTCAGACCTGGTGAGAAGGAGAGTTCTTCCGATGCAGGT 

TCAGTCAAGGGAGACACCATTACACCAGCTATACTCAGCGGCCTTCAGCAAGCAGAAACTGC 

AGAGTGCCCCTACCAAGAAGCCAGCCCTTCCCTTTGGAGATCTGCCCATGGGATACCAACAT 

CTGCATACCCAGCTCCAGTATGAGTGCATCTCACCCTTCTACCGCCGCCTGGGCAGCAGCAG 

GAGGACATGTCTGAGGACTGGGAAGTGGAGTGGGCGGGCACCATCCTGCATCCCTATCTGCG 

GGAAAATTGAGAACATCACTGCTCCAAAGACCCAAGGGTTGCGCTGGCCGTGGCAGGCAGCC 

ATCTACAGGAGGACCAGCGGGGTGCATGACGGCAGCCTACAC/^GGGAGCGTGGTTCCTAGT 

CTGCAGCGGTGCCCTGGTGAATGAGCGCACTGTGGTGGTGGCTGCCCACTGTGTTACTGACC 

TGGGGAAGGTCACCATGATCAAGACAGCAGACCTG/y^GTTGTTTTGGGGAAATTCTACCGG 

GATGATGACCGGGATGAGAAGACCATCCAGAGCCTACAGATTTCTGCTATCATTCTGCATCC 

CAACTATGACCCCATCCTGCTTGATGCTGACATCGCCATCCTGAAGCTCCTAGACAAGGCCC 

GTATCAGCACCCGAGTCCAGCCCATCTGCCTCGCTGCCAGTCGGGATCTCAGCACTTCCTTC 

CAGGAGTCCCACATCACTGTGGCTGGCTGGAATGTCCTGGCAGACGTGAGGAGCCCTGGCTT 

CAAGAACGACACACTGCGCTCTGGGGTGGTCAGTGTGGTGGACTCGCTGCTGTGTGAGGAGC 

AGCATGAGGACCATGGCATCCCAGTGAGTGTCACTGATAACATGTTCTGTGCCAGCTGGGAA 

CCCACTGCCCCTTCTGATATCTGCACTGCAGAGACAGGAGGCATCGCGGCTGTGTCCTTCCC 

GGGACGAGCATCTCCTGAGCCACGCTGGCATCTGATGGGACTGGTCAGCTGGAGCTATGATA 

AAACATGCAGCCACAGGCTCTCCACTGCCTTCACCAAGGTGCTGCCTTTTAAAGACTGGATT 

GAAAGAAATATGAAATGAACCATGCTCATGCACTCCTTGAGAAGTGTTTCTGTATATCCGTC 

TGTACGTGTGTCATTGCGTGAAGCAGTGTGGGCCTGAAGTGTGATTTGGCCTGTGAACTTGG 

CTGTGCCAGGGCTTCTGACTTCAGGGACAAAACTCAGTGAAGGGTGAGTAGACCTCCATTGC 

TGGTAGGCTGATGCCGCGTCCACTACTAGGACAGCCAATTGGAAGATGCCAGGGCTTGCAAG 

AAGTAAGTTTCTTCAAAGAAGACCATATACAAAACCTCTCCACTCCACTGACCTGGTGGTCT 

TCCCC/^CTTTCAGTTATACGAATGCCATCAGCTTGACCAGGGAAGATCTGGGCTTCATGAG 

GCCCCTTTTGAGGCTCTCAAGTTCTAGAGAGCTGCCTGTGGGACAGCCCAGGGCAGCAGAGC 

TGGGATGTGGTGCATGCCTTTGTGTACATGGCCACAGTACAGTCTGGTCCTTTTCCTTCCCC 

ATCTCTTGTACACATTTTAATAAAATAAGGGTTGGCTTCTGAACTACAAAAAAAAAAAAAAA 

AAAAAAAAAT^AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

AAAAAAAAT^AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 159 

MEUSCWTQLGLTFLQIJjLISSLPREYTVINEACPGAEWNIMCRECCEYDQIECVCPGKI^ 
GYTI PCCRNEENECDSCLIHPGCTI FENCKSCRNGSWGGTLDDFYVKGFYCAECRAGWYGGD 
OyiRCGQVLRAPKGQILLESYPLNAHCEWTIHAKPGFVIQLRFVMLSLEFDYMCQTO 
GDNRDGQI IKRVCGNERPAP IQS I GSSLHVLFHSDGSKNFDGFHAI YEE ITACSSSPCFHDG 
TCVLDKAGSYKCACLAGYTGQRCENLLEERNCSDPGGPVNGYQKITGGPGLINGRHAKIGTV 
VS FFCNNS YVLSGNEKRTCQQNGEWSGKQP I C I KACREPKI SDLVRRRVLPMQVQSRETPLH 
QLYSAAFSKQKLQSAPTKKPALPFGDLPMGYQHLHTQLQYECISPFYRRLGSSRRTCLRTGK 
WSGRAPSCIPICGKIENITAPKTQGLRWPWQAAIYRRTSGVHDGSLHKGAWFLVCSGALVNE 
RTVWAAHCVTDLGKVTM I KTADLK WLGKF YRDDDRDE KTIQSLQISAII LHPNYDP I LLD 
ADIAILKLLDKARISTRVQPICLAASRDLSTSFQESHITVAGVmVIjyDWSPGFKNDTLRS 
WSWDSLLCEEQHEDHGIPVSVTDNMFCASWEPTAPSDICTAETGGIAAVSFPGRASPEPR 
WHLMGLVSWS YDKTCSHRLSTAFTKVLPFKDWI ERNMK 
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ACCAGGCATTGTATCTTCAGTTGTCATCAAGTTCGCAATCAGATTGGAAAAGCTCAACTTGA 

AGCTTTCITGCCTGCAGTGAAGCAGAGAGATAGATATTATTCACGTAATAAAAAACAT^ 

TTCAACCTGACTTTCCACCTTTCCTACAAATTCCGATTACTGTTGCTGTTGACTTTGTGCCT 

GACAGTGGTTGGGTGGGCCACCAGTAACTACTTCGTGGGTGCCATTCAAGAGATTCCTAAAG 

CAAAGGAGTTCATGGCTAATTTCCATAAGACCCTCATTTTGGGGAAGGGAAAAACTCTGACT 

AATGAAGCATCCACGAAGAAGGTAGAACTTGACAACTGTCCTTCTGTGTCTCCTTACCTCAG 

AGGCCAGAGCAAGCTCATTTTCAAACCAGATCTCACTTTGGAAGAGGTACAGGCAGAAAATC 

CCAAAGTGTCCAGAGGCCGGTATCGCCCTCAGGAATGTAAAGCTTTACAGAGG^ 

GTTCCCCACCGGAACAGAGAGAAACACCTGATGTACCTGCTGGAACATCTGCATCCCTTCCT 

GCAGAGGCAGCAGCTGGATTATGGCATCTACGTCATCCACCAGGCTGAAGGTAAAAAGTTTA 

ATCGAGCCAAACTCTTGAATGTGGGCTATCTAGAAGCCCTCAAGGAAGAAAATTGGGACTGC 

TTTATATTCCACGATGTGGACCTGGTACCCGAGAATGACTTTAACCTTTACAAGTGTGAGGA 

GCATCCCAAGCATCTGGTGGTTGGCAGGAACAGCACTGGGTACAGGTTACGTTACAGTGGAT 

ATTTTGGGGGTGTTACTGCCCTAAGCAGAGAGCAGTTTTTCAAGGTGAATGGATTCTCTAAC 

AACTACTGGGGATGGGGAGGCGAAGACGATGACCTCAGACTCAGGGTTGAGCTCCAAAGAAT 

gaaaatttcccggcccctgcctgaagtgggtaaatatacaatggtcttccacactagagaca 
aaggcaatgaggtgaacgcagaacggatgaagctcttacaccaagtgtcacgagtctggaga 
acagatgggttgagtagttgttcttataaattagtatctgtggaacacaatcctttatatat 

CAACATCACAGTGGATTTCTGGTTTGGTGCATGACCCTGGATCTTTTGGTGATGTTTGGAAG 

aactgattctttgtttgcaataattttggcctagagacttcaaatagtagcacacattaaga 

ACCTGTTACAGCTCATTGTTGAGCTGT^TTTTTCCTTTTTGTATTTTCTTAGCAGAGCTCCT 

ggtgatgtagagtataaaacagttgtaacaagacagctttcttagtcattttgatcatgagg 
gttaaatattgtaatatggatacttgaaggactttatataaaaggatgactcaaaggataaa 
atgaacgctatttgaggactctggttgaaggagatttatttaaatttgaagtaatatattat 
gggataaaaggccacaggaaataagactgctgaatgtctgagagaaccagagttgttctcgt 
ccaaggtagaaaggtacgaagatacaatactgttattcatttatcctgtacaatcatctgtg 
aagtggtggtgtcaggtgagaaggcgtccacaaaagaggggagaaaaggcgacgaatcagga 
cacagtgaacttgggaatgaagaggtagcaggagggtggagtgtcggctgcaaaggcagcag 
tagctgagctggttgcaggtgctgatagccttcaggggaggacctgcccaggtatgccttcc 

AGTGATGCCCy^CCAGAGAATACATTCTCTATTAGTTTTTAAAGAGTTTTTGTAAAATGATTT 
TGTAO^GTAGGATATGAATTAGCAGTTTACAAGTTTACATATTAACTAATAATAAATATGT 
CTATCAAATACCTCTGTAGTAAAATGTGAAAAAGCAAAA 
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MGFNLTFHLSYKFRLLLLLTLCLTWGWATSNYFVGAIQEIPKAK^ 

LTNEASTKKVELDNCPSVSPYLRGQSKLIFKPDLTLEEVQAENP^^^^ 

AILVPHRNREKHI^LEHLHPFLQRQQLDYGIYVIHQAEGKKFNRAKL^ 

WDCFIFHDVDLVPENDFNLYKCEEHPKHLWGRNSTGYRLRYSGYFGGXrrALSR^^ 

FSNNYWGWGGEDDDLRIJIVELQRMKISRPLPEVGKYTMVFHTRDKGNEV^^ 

VWRTDGLSSCSYKLVSVEHNPLYINITVDFWFGA 

Is^ortant features: 
Signal peptide: 

amino acids 1-27 

N-glycosylation sites: 

amino acids 4-7, 220-223 and 335-338 

Xylose isomerase proteins: 

amino acids 191-201 
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FIGURE 162 

CGTGGGCCGGGGTCGCGCAGCGGGCTGTGGGCGCGCCCGGAGGAGCGACCGCCGCAGTTCTC 

GAGCTCCAGCTGCATTCCCTCCGCGTCCGCCCCACGCTTCTCCCGCTCCGGGCCCCGCAATG 

GCCCAGGCAGTGTGGTCGCGCCTCGGCCGCATCCTCTGGCTTGCCTGCCTCCTGCCCTGGGC 

CCCGGCAGGGGTGGCCGCAGGCCTGTATGAACTCAATCTCACCACCGATAGCCCTGCCACCA 

CGGGAGCGGTGGTGACCATCTCGGCCAGCCTGGTGGCCAAGGACAACGGCAGCCTGGCCCTG 

CCCGCTGACGCCCACCTCTACCGCTTCCACTGGATCCACACCCCGCTGGTGCTTACTGGCAA 

GATGGAGAAGGGTCTCAGCTCCACCATCCGTGTGGTCGGCCACGTGCCCGGGGAATTCCCGG 

TCTCTGTCTGGGTCACTGCCGCTGACTGCTGGATGTGCCAGCCTGTGGCCAGGGGCTTTGTG 

GTCCTCCCCATCACAGAGTTCCTCGTGGGGGACCTTGTTGTCACCCAGAACACTTCCCTACC 

CTGGCCCAGCTCCTATCTCACTAAGACCGTCCTGAAAGTCTCCTTCCTCCTCCACGACCCGA 

GCAACTTCCTCAAGACCGCCTTGTTTCTCTACAGCTGGGACTTCGGGGACGGGACCCAGATG 

GTGACTGAAGACTCCGTGGTCTATTATAACTATTCCATCATCGGGACCTTCACCGTGAAGCT 

CAAAGTGGTGGCGGAGTGGGAAGAGGTGGAGCCGGATGCCACGAGGGCTGTGAAGCAGAAGA 

CCGGGGACTTCTCCGCCTCGCTGAAGCTGCAGGAAACCCTTCGAGGCATCCAAGTGTTGGGG 

CCCACCCTAATTCAGACCTTCCAAAAGATGACCGTGACCTTGAACTTCCTGGGGAGCCCTCC 

TCTGACTGTGTGCTGGCGTCTCAAGCCTGAGTGCCTCCCGCTGGAGGAAGGGGAGTGCCACC 

CTGTGTCCGTGGCCAGCACAGCGTACAACCTGACCCACACCTTCAGGGACCCTGGGGACTAC 

TGCTTCAGCATCCGGGCCGAGAATATCATCAGCAAGACACATCAGTACCACAAGATCCAGGT 

GTGGCCCTCCAGAATCCAGCCGGCTGTCTTTGCTTTCCCATGTGCTACACTTATCACTGTGA 

TGTTGGCCTTCATCATGTACATGACCCTGCGGAATGCCACTCAGCAAAAGGACATGGTGGAG 

AACCCGGAGCCACCCTCTGGGGTCAGGTGCTGCTGCCAGATGTGCTGTGGGCCTTTCTTGCT 

GGAGACTCCATCTGAGTACCTGGAAATTGTTCGTGAGAACCACGGGCTGCTCCCGCCCCTCT 

ATAAGTCTGTCAAAACTTACACCGTGTGAGCACTCCCCCTCCCCACCCCATCTCAGTGTTAA 

CTGACTGCTGACTTGGAGTTTCCAGCAGGGTGGTGTGCACCACTGACCAGGAGGGGTTCATT 

TGCGTGGGGCTGTTGGCCTGGATCATCCATCCATCTGTACAGTTCAGCCACTGCCACAAGCC 

CCTCCCTCTCTGTCACCCCTGACCCCAGCCATTCACCCATCTGTACAGTCCAGCCACTGACA 

TAAGCCCCACTCGGTTACCACCCCCTTGACCCCCTACCTTTGAAGAGGCTTCGTGCAGGACT 

TTGATGCTTGGGGTGTTCCGTGTTGACTCCTAGGTGGGCCTGGCTGCCCACTGCCCATTCCT 

CTCATATTGGCACATCTGCTGTCCATTGGGGGTTCTCAGTTTCCTCCCCCAGACAGCCCTAC 

CTGTGCCAGAGAGCTAGAAAGAAGGTCATAAAGGGTTAAAAATCCATAACTAAAGGTTGTAC 

ACATAGATGGGCACACTCACAGAGAGAAGTGTGCATGTACACACACCACACACACACACACA 

CACACACACACAGAAATATAAACACATGCGTCACATGGGCATTTCAGATGATCAGCTCTGTA 

TCTGGTTAAGTCGGTTGCTGGGATGCACCCTGCACTAGAGCTGAAAGGAAATTTGACCTCCA 

AGCAGCCCTGACAGGTTCTGGGCCCGGGCCCTCCCTTTGTGCTTTGTCTCTGCAGTTCTTGC 

GCCCTTTATAAGGCCATCCTAGTCCCTGCTGGCTGGCAGGGGCCTGGATGGGGGGCAGGACT 

AATACTGAGTGATTGCAGAGTGCTTTATAAATATCACCTTATTTTATCGAAACCCATCTGTG 

AAACTTTCACTGAGGAAAAGGCCTTGCAGCGGTAGAAGAGGTTGAGTCAAGGCCGGGCGCGG 

TGGCTCACGCCTGTAATCCCAGCACTTTGGGAGGCCGAGGCGGGTGGATCACGAGATCAGGA 

GATCGAGACCACCCTGGCTAACACGGTGAAACCCCGTCTCTACTAAAAAAATACAAAAAGTT 

AGCCGGGCGTGGTGGTGGGTGCCTGTAGTCCCAGCTACTCGGGAGGCTGAGGCAGGAGAATG 

GTGCGAACCCGGGAGGCGGAGCTTGCAGTGAGCCCAGATGGCGCCACTGCACTCCAGCCTGA 
GTGACAGAGCGAGACTCTGTCTCCA 
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MAQAVWSRLGRILWLACLLPWAPAGVAAGLYELNLTTDSPATTGAWTISASLVAKDNGSLA 
LPADAHLYRFHWIHTPLVLTGKMEKGLSSTIRWGHVPGEFPVSVWVTAADCWMCQPVARGP 
WLPITEFLVGDLWTQNTSLPWPSSYLTKTVLKVSFLLHDPSNFLKTALFLYSWDFGDGTQ 
MVTEDSVVYYNYSIIGTFTVKLKVVAEWEEVEPDATRAVKQKTGDFSASLKLQETLRGIQVL 
GPTLIQTFQKMTVTLNFLGSPPLTVCWRLKPECLPLEEGECHPVSVASTAYNLTHTFRDPGD 
YCFSIRAENIISKTHQYHKIQWPSRIQPAVFAFPCATLITVMIJ^IMYMTIJWATQQKDM^ 
ENPEPPSGVRCCCQMCCGPPLLETPSEYLEIVRENHGLLPPLYKSVKTYTV 

la^ortant features of the protein: 
Signal peptide: 
amino acids 1-24 



Transmeinbrane domain : 
amino acids 339-362 



N-glyeosylation sites. 

amino acids 34-37, 58-61, 142-145, 197-200, 300-303 and 364-367 
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GCTCAAGACCCAGCAGTGGGACAGCCAGACAGACGGCACGATSGCACTGAGCTCCCAGATCT 
GGGCCGCTTGCCTCCTGCTCCTCCTCCTCCTCGCCAGCCTGACCAGTGGCTCTGTTTTCCCA 
CAACAGACGGGACAACTTGCAGAGCTGCAACCCCAGGACAGAGCTGGAGCCAGGGCCAGCTG 
GATGCCCATGTTCCAGAGGCGAAGGAGGCGAGACACCCACTTCCCCATCTGCATTTTCTGCT 
GCGGCTGCTGTCATCGATCAAAGTGTGGGATGTGCTGCAAGACGTAGAACCTACCTGCCCTG 
CCCCCGTCCCCTCCCTTCCTTATTTATTCCTGCTGCCCCAGAACATAGGTCTTGGAATAAAA 
TGGCTGGTTCTTTTGTTTTCCAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 

MALSSQIWAACLLLLLLLASLTSGSVFPQQTGQLAELQPQDRAGARASWMPMPQRRRRRDTH 
FPICIFCCGCCHRSKCGMCCKT 
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FIGURE 

CTGTCAGGAAGGACCATCTGAAGGCTGCAATTTGTTCTTAGGGAGGCAGGTGCTGGCCTGGC 

CTGGATCTTCCACCATGTTCCTGTTGCTGCCTTTTGATAGCCTGATTGTCAACCTTCTGGGC 

ATCTCCCTGACTGTCCTCTTCACCCTCCTTCTCGTTTTCATCATAGTGCCAGCCATTTTTGG 

AGTCTCCTTTGGTATCCGCAAACTCTACATGAAAAGTCTGTTAAAAATCTTTGCGTGGGCTA 

CCTTGAGAATGGAGCGAGGAGCCAAGGAGAAGAACCACCAGCTTTACAAGCCCTACACCAAC 

GGAATCATTGCAAAGGATCCCACTTCACTAGAAGAAGAGATCAAAGAGATTCGTCGAiS^ 

TAGTAGTAAGGCTCTGGACAACACTCCAGAGTTCGAGCTCTCTGACATTTTCTACTTTTGCC 

GGAAAGGAATGGAGACCATTATGGATGATGAGGTGACAAAGAGATTCTCAGCAGAAGAACTG 

GAGTCCTGGAACCTGCTGAGCAGAACCAATTATAACTTCCAGTACATCAGCCTTCGGCTCAC 

GGTCCTGTGGGGGTTAGGAGTGCTGATTCGGTACTGCTTTCTGCTGCCGCTCAGGATAGCAC 

TGGCTTTCACAGGGATTAGCCTTCTGGTGGTGGGCACAACTGTGGTGGGATACTTGCCAAAT 

GGGAGGTTTAAGGAATTCATGAGTAAACATGTTCACTTAATGTGTTACCGGATCTGCGTGCG 

AGCGCTGACAGCCATCATCACCTACCATGACAGGGAAAACAGACCAAGAAATGGTGGCATCT 

GTGTGGCCAATCATACCTCACCGATCGATGTGATCATCTTGGCCAGCGATGGCTATTATGCC 

ATGGTGGGTCAAGTGCACGGGGGACTCATGGGTGTGATTCAGAGAGCCATGGTGAAGGCCTG 

CCCACACGTCTGGTTTGAGCGCTCGGAAGTGAAGGATCGCCACCTGGTGGCTAAGAGACTGA 

CTGAACATGTGCAAGATAAAAGCAAGCTGCCTATCCTCATCTTCCCAGAAGGAACCTC^^^ 

AATAATACATCGGTGATGATGTTCAAAAAGGGAAGTTTTGAAATTGGAGCCACAGTTTACCC 

TGTTGCTATCAAGTATGACCCTCAATTTGGCGATGCCTTCTGGAACAGCAGCAAATACGGGA 

tggtgacgtacctgctgcgaatgatgaccagctgggccattgtctgcagcgtgtSg^^^ 

CCTCCCATGACTAGAGAGGCAGATGAAGATGCTGTCCAGTTTGCGAATAGGGTGAAATCTGC 

cattgccaggcagggaggacttgtggacctgctgtgggatgggggcctgaagagggagaagg 

TGAAGGACACGTTCAAGGAGGAGCAGCAGAAGCTGTACAGCAAGATGATCGTGGGGAAC^? 

aaggacaggagccgctcctgagcctgcctccagctggctggggccaccgtgcggggtgccaa 

cgggctcagagctggagttgccgccgccgcccccactgctgtgtcctttccagaSc^^ 

ctccccgggctgctctggatcccaggactccggctttcgccgagccg^^^ 

gcacccggcgcagcctacccttggtggtctaaacggatgctgctgggtgttgcgacccagga 

cgagatgccttgtttcttttacaataagtcgttggaggaatgccattaaagtgSctc^^^^ 

cctttgovcgctgtgcgggctgagtggttggggagatgtggccatggtcttgtgctagSt 

ggcggtacaagagtctgttatgcaagcccgtgtgccagggatgtgctgggggcgScacccg 

ctctccaggaaaggcacagctgaggcactgtggctggcttcggcctcaacatcgcccccagc 

cttggagctctgcagacatgataggaaggaaactgtcatctgcaggggctttcagcaaaatg 

aagggttagatttttatgctgctgctgatggggttactaaagggaggggaagSgcc^^ 

ggccgctgactgggccatggggagaacgtgtgttcgtactccaggctaaccctgaactcc^^ 

atgtgatgcgcgctttgttgaatgtgtgtctcggtttccccatctgtaatatgactSgg^^ 

GAATGGTGGTGATTCCTACCTCACAGGGCTGTTGTGGGGATT^^^ 
CTCAGAGCTGCTGCACTGGGCTTTGGATTTCTTCTTGl^^^ 
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FIGURE 167 
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MFLLLPFDSLIVNLLGISLTVLFTLLLVFIIVPAIFGVSFGIRKLYMKSLLKIFAWATLRME 

RGAKEKNHQLYKPYTNGIIAKDPTSLEEEIKEIRRSGSSKALDNTPEFELSDIFYFCRKGME 

TIMDDEVTKRFSAEELESWNLLSRTNYNFQYISLRLTVLWGLGVLIRYCFLLPLRIALAFTG 

ISIiWGTTWGYLPNGRFKEFMSKHVHLMCYRICVRALTAIITYHDRENRPRNGGICVANH 

TSPIDVIILASDGYYAMVGQVHGGLMGVIQRAMVKACPHVWFERSEVKDRHLVAKRLTEHVQ 

DKSKLPILIFPEGTCINNTSVMMFKKGSFEIGATVYPVAIKYDPQFGDAFWNSSKYGMVTYL 

LRMMTSWAIVCSVWYLPPMTREADEDAVQFANRVKSAIARQGGLVDLLWDGGLKREKVKDTF 
KEEQQKLYSKMIVGNHKDRSRS 
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FIGURE 168 
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GCCCCTCGAAACCAGGACTCCAGCACCTCTGGTCCCGCCCTCACCCGGACCCCTGGCCCTCA 
CGTCTCCTCCAGGGaiSGCGCTGGCGGCTTTGATGATCGCCCTCGGCAGCCTCGGCCTCCAC 
ACCTGGCAGGCCCAGGCTGTTCCCACCATCCrGCCCCTGGGCCTGGCTCCAGACACCTTTGA 
CGATACCTATGTGGGTTGTGCAGAGGAGATGGAGGAGAAGGCAGCCCCCCTGCTAAAGGAGG 
AAATGGCCCACCATGCCCTGCTGCGGGAATCCTGGGAGGCAGCCCAGGAGACCTGGGAGGAC 
AAGCGTCGAGGGCTTACCTTGCCCCCTGGCTTCAAAGCCCAGAATGGAATAGCCATTATGGT 
CTACACCAACTCATCGAACACCTTGTACTGGGAGTTGAATCAGGCCGTGCGGACGGGCGGAG 
GCTCCCGGGAGCTCTACATGAGGCACTTTCCCTTCAAGGCCCTGCATTTCTACCTGATCCGG 
GCCCTGCAGCTGCTGCGAGGCAGTGGGGGCTGCAGCAGGGGACCTGGGGAGGTGGTGTTCCG 
AGGTGTGGGCAGCCTTCGCTTTGAACCCAAGAGGCTGGGGGACTCTGTCCGCTTGGGCCAGT 
TTGCCTCCAGCTCCCTGGATAAGGCAGTGGCCCACAGATTTGGGGAGAAGAGGCGGGGCTGT 
GTGTCTGCGCCAGGGGTGCAGCTAGGGTCACAATCTGAGGGGGCCTCCTCTCTGCCCCCCTG 
GAAGACTCTGCTCTTGGCCCCTGGAGAGTTCCAGCTCTCAGGGGTTGGGCCCTGAAAGTCCA 
ACATCTGCCACTTAGGAGCCCTGGGAACGGGTGACCTTCATATGACGAAGAGGCACCTCCAG 
CAGCCTTGAGAAGCAAGAACATGGTTCCGGACCCAGCCCTAGCAGCCTTCTCCCCAACCAGG 
ATGTTGGCCTGGGGAGGCCACAGCAGGGCTGAGGGAACTCTGCTATGTGATGGGGACTTCCT 
GGGACAAGCAAGGAAAGTACTGAGGCAGCCACTTGATTGAACGGTGTTGCAATGTGGAGACA 
TGGAGTTTTATTGAGGTAGCTACGTGATTAAATGGTATTGCAGTGTGGA 



BNSDoaa <wo taeaaeata i > 



wo 99/63088 / PCT/US99/12252 

FIGURE 169 

MALAALMIALGSLGLHTWQAQAVPTILPLGLAPDTFDDTYVGCy^EMEEKAAPLLKEEMAHH 
ALLRESWEAAQETWEDKRRGLTLPPGFKAQNGIAIMVYTNSSNTLYWELNQAVRTGGGSREL 
YMRHFPFKALHFYLIRALQLLRGSGGCSRGPGEWFRGVGSLRFEPKRLGDSVRLGQFASSS 
LDKAVAHRPGEKRRGCVSAPGVQLGSQSEGASSLPPWKTLLLAPGEPQLSGVGP 
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FIGURE 17n 
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GTGGCTTCATTTCAGTGGCTGACTTCCAGAGAGCAATATGGCTGGTTCCCCAACATGCCTCA 

CCCTCATCTATATCCTTTGGCAGCTCACAGGGTCAGCAGCCTCTGGACCCGTGAAAGAGCTG 

GTCGGTTCCGTTGGTGGGGCCGTGACTTTCCCCCTGAAGTCCAAAGTAAAGCAAGTTGACTC 

TATTGTCTGGACCTTO^CACAACCCCTCTTGTCACCATACAGCCAGAAGGGGGCACTATCA 

TAGTGACCCAAAATCGTAATAGGGAGAGAGTAGACTTCCCAGATGGAGGCTACTCCCTGAAG 

CTCAGCAAACTGAAGAAGAATGACTCAGGGATCTACTATGTGGGGATATACAGCTCATCACT 

CCAGCAGCCCTCCACCCAGGAGTACGTGCTGCATGTCTACGAGCACCTGTCAAAGCCTAAAG 

TCACCATGGGTCTGCAGAGCAATAAGAATGGCACCTGTGTGACCAATCTGACATGCTGCATG 

GAACATGGGGAAGAGGATGTGATTTATACCTGGAAGGCCCTGGGGCAAGCAGCCAATGAGTC 

CCATAATGGGTCCATCCTCCCCATCTCCTGGAGATGGGGAGAAAGTGATATGACCTTCATCT 

GCGTTGCCAGGAACCCTGTCAGCAGAAACTTCTCAAGCCCCATCCTTGCCAGGAAGCTCTGT 

GAAGGTGCTGCTGATGACCCAGATTCCTCCATGGTCCTCCTGTGTCTCCTGTTGGTGCCCCT 

CCTGCTCAGTCTCTTTGTACTGGGGCTATTTCTTTGGTTTCTGAAGAGAGAGAGACAAGAAG 

AGTACATTGAAGAGAAGAAGAGAGTGGACATTTGTCGGGAAACTCCTAACATATGCCCCCAT 

TCTGGAGAGAACACAGAGTACGACACAATCCCTCACACTAATAGAACAATCCTAAAGGAAGA 

TCCAGCAAATACGGTTTACTCCACTGTGGAAATACCGAAAAAGATGGAAAATCCCCACTCAC 

TGCTCACGATGCCAGACACACCAAGGCTATTTGCCTATGAGAATGTTATCTAGACAGCAGTG 
CACTCCCCTAAGTCTCTGCTCA 
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FIGURE 171 



MAGSPTCLTLIYILWQLTGSAASGPVKELVGSVGGAVTFPLKSKVKQVDSIVWTFNTTPLVT 

IQPEGGTIIVTQNRNRERVDFPDGGYSLKLSKLKKNDSGIYYVGIYSSSLQQPSTQEYVLHV 

YEHLSKPKVTMGLQSNKNGTCVTNLTCCMEHGEEDVIYTWKALGQAANESHNGSILPISWRW 

GESDMTFICVARNPVSRNFSSPILARKLCEGAADDPDSSMVLLCLLLVPLLLSLFVLGLFLW 

FLKRERQEEYIEEKKRVDICRETPNICPHSGEOTEYDTIPHTNRTILKEDPANTVYSTVEIP 
KKMENPHSLLTMPDTPRLFAYENVI 
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FIGURK 172 
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CTGGTTCCCCAACATGCCTCACCCTCATCTATATCCTTTGGCAGCTCACAGGGTCAGCAGCC 

TCTGGACCCGTGAAAGAGCTGGTCGGTTCCGTTGGTGGGGCCGTGACTTTCCCCCTGAAGTC 

CAAAGTAAAGCAAGTTGACTCTATTGTCTGGACCTTCAACACAACCCCTCTTGTCACCATAC 

AGCCA6AAGGGGGCACTATCATAGTGACCCAAAATCGTAATAGGGAGAGAGTAGACTTCCCA 

GATGGAGGCTACTCCCTGAAGCTCAGCAAACTQAAGAAGAATGACTCAGGGATCTACTATGT 

GGGGATATACAGCTCATCACTCCAGCAGCCCTCCACCCAGGAGTACGTGCTGCATGTCTACG 

AGCACCTGTCAAAGCCTAAAGTCACCATGGGTCTGCAGAGCAATAAGAATGGCACCTGTGTG 

ACCAATCTGACATGCTGCATGGAACATGGGGAAGAGGATGTGATTTATACCTGGAAGGCCCT 

GGGGCAAGCAGCCAATGAGTCCCATAATGGGTCCATCCTCCCCATCTCCTGGAGATGGGGAG 

AAAGTGATATGACCTTCATCTGCGTTGCCAGGAACCCTGTCAGCAGAAACTTCTCAAGCCCC 

ATCCTTGCCAGGAAGCTCTGTGAAGGTGCTGCTGATGACCCAGATTCCTCCATGGTCCTCCT 

GTGTCTCCTGTTGGTGCCCCTCCTGCTCAGTCTCTTTGTACTGGGGCTATTTCTTTGGTTTC 

TGAAGAGAGAGAGACAAGAAGAGTACATTGAAGAGAAGAAGAGAGTGGACATTTGTCGGGAA 

ACTCCTAACATATGCCCCCATTCTGGAGAGAACACAGAGTACGACACAATCCCTCACACTAA 

TAGAACAATCCTAAAGGAAGATCCAGCAAATACGGTTTACTCCACTGTGGAAATACCGAAAA 

AGATGGAAAATCCCCACTCACTGCTCACGATGCCAGACACACCAAGGCTATTTGCCTATGAG 

AATGTTATCTAGACAGCAGTGCACTCCCCTAAGTCTCTGCTCAAAAAAAAAAAAAAAAAAA 
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FIGURE 17.1 

GAAAGACGTGGTCCTGACAGACAC3ACAATCCTATTCCCTACCAAAASSAAGATGCTGCTGCT 

GCTGTGTTTGGGACTGACCCTAGTCTGTGTCCATGCAGAAGAAGCTAGTTCTACGGGAAGGA 

ACTTTAATGTAGAAAAGATTAATGGGGAATGGCATACTATTATCCTGGCCTCTGACAAAAGA 

GAAAAGATAGAAGAAa^TGGOACTTTAGACTTTTTCTGGAGCAAATCCATGTCTTGGAGAA 

TTCCTTAGTTCTTAAAGTCCATACTGTAAGAGATGAAGAGTGCTCCGAATTATCTATGGTTG 

CTGACAAAACAGAAAAGGCTGGTGAATATTCTGTGACGTATGATGGATTCAATACATTTACT 

ATACCTAAGACAGACTATGATAACTTTCTTATGGCTCACCTCATTAACGAAAAGGATGGGGA 

AACCTTCCAGCTGATGGGGCTCTATGGCCGAGAACCAGATTTGAGTTCAGACATCAAGGAAA 

GGTTTGCACAACTATGTGAGGAGCATGGAATCCTTAGAGAAAATATCATTGACCTATCCAAT 

GCCAATCGCTGCCTCCAGGCCCGAGAATGAAGAATGGCCTGAGCCTCCAGTGTTGAGTGGAC 

ACTTCTCACCAGGACTCCAC(^TCATCCCTTCCTATCCATACAGCATCCCCAGTATAAATTC 

TGTGATCTGCATTCCATCCTGTCTCACTGAGAAGTCCT^TTCCAGTCTATCAACATGTTACC 

TAGGATACCTCATCAAGAATCAAAGACTTCTTTAAATTTCTCTTTGATACACCCTTGACAAT 

TTTTCATGAAATTATTCCTCTTCCTGTTO^TAAATGATTACCCTTGCACTTAA 
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FIGURE 174 

MKMLLLLCK3LTLVCVHAEEASSTGRNFNVEKINGEWHTIIIJ^DKREKIEEHGNFRLFL 

IHVLENSLVLKVHTVRDEECSELSMVADKTEKAGEYSVTYDGFNTFTIPKTDYDNFLMAHLI 

NEKDGETFQLMGLYGREPDLSSDIKERFAQLCEEHGILRENIIDLSNANRCLQARE 
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FIGURE 175 

GGCTCGAGCGTTTCTGAGCCAGGGGTGACCATGACCTGCTGCGAAGGATGGACATCCTGCAA 

TGGATTCAGCCTGCTGGTTCTACTGCTGTTAGGAGTAGTTCTCAATGCGATACCTCTAATTG 

TCAGCTTAGTTGAGGAA6ACCAATTTTCTCAAAACCCCATCTCTTGCTTTGAGTGGTGGTTC 

CCAGG/^TTATAGGAGCAGGTCTGATGGCCATTCOIGCAACAACAATGTCCTTGACAGCAAG 

AAAAAGAGCGTGCTGCAACAACAGAACTGGAATGTTTCTTTCATCATTTTTCAGTC 

CAGTCATTGGTGCTCTGTATTGCATGCTGATATCCATCCAGGCTCTCTTAAAAGGTCCTCTC 

ATGTGTAATTCTCCT^GCAACAGTAATGCCAATTGTGAATTTTCATTGAAAAACATCAGTGA 

CATTCATCCAGAATCCTTCAACTTGCAGTGGTTTTTCAATGACTCTTGTGCACCTCCTACTG 

GTTTCAATAAACCCACCAGTAACGACACCATGGCGAGTGGCTGGAGAGCATCTAGTTTCCAC 

TTCGATTCTGAAGAAAACAAACATAGGCTTATCCACTTCTCAGTATTTTTAGGTCTATTGCT 

TGTTGGAATTCTGGAGGTCCTGTTTGGGCTCAGTCAGATAGTCATCGGTTTCCTTGGCTGTC 

TGTGTGGAGTCTCTAAGCGAAGAAGTCAAATTGTGTAGTTTAATGGGAATAAAATGTAAGTA 

TCAGTAGTTTGAAAAAAAT^AAA 
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FIGURE 176 



OTCCEGWTSCNGFSLLVLLLLGVVLNAI PL I VSLVEEDQFSQNP ISCFEWWFPG 1 1 GAGLMA 
IPATTMSLTARKRACCNNRTGMFLSSFFSVITVIGALYCMLISIQAIJjKGPLMCNSPSNSNA 
NCEFSLKNISDIHPESFNLQWFFNDSCAPPTGFNKPTSNDTMASGWRASSFHFDSEENKHRL 
IHFSVFLGLIiLVGILEVLFGLSQIVIGFLGCLCGVSKRRSQIV 
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FIGURE 177 

GTCGAATCCAAATCACTCATTGTGAAAGCTGAGCTCACAGCCGAATAAGCCACCATGAGGCT 

GTCAGTGTGTCTCCTGATGGTCrCGCTGGCCCTTTGCTGCTACCAGGCCCATGCTCTTGTCT 

GCCCAGCTGTTGCTTCTGAGATCACAGTCTTCITATTCTTAAGTGACGCTGCGGTAAACCTC 

CAAGTTGCCAAACTTAATCCACCTCCAGAAGCTCTTGCAGCCAAGTTGGAAGTGAAGCACTG 

OVCCGATCAGATATCTTTTAAGAAACGACTCTCATTGAj^AAAGTCCTGGTGGAA ATAG TGAA 

AAAATGTGGTGTGTGACATGTAAAAATGCTCAACCTGGTTTCCAAAGTCTTTC^ 

CTGATCTTCACTAAAAATTGTAAAGGTTTCAACACGTTGCTTTAATAAATCACTTGCCCTGC 



wo 99/63088 

^ . I PCTAJS99/12252 

FIGURE 17« 

MRLSVCIilWSIJO-CCYQAHALVCPAVASEIWFLFLSDAAVNLQVAiajJPPPEAIJAKLEV 
KHCTDQISFKKRLSLKKSWWK 
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FIGURE 17Q 



ATCCGTTCTCTGCGCTGCCAGCTCAGGTGAGCCCTCGCCAAGGTGACCTCGCAGGACACTGG 

TGAAGGAGCAGTGAGGAACCTGCAGAGTCACACAGTTGCTGACCAATTGAGCTGTGAGCCTG 

GAGCAGATCCGTGGGCTGCAGACCCCCGCCCCAGTGCCTCTCCCCCTGCAGCCCTGCCCCTC 

GAACTGTGAC&ISQAGAGAGTGACCCTGGCCCTTCTCCTACTGGCAGGCCTGACTGCCTTGG 

AAGCCAATGACCCATTTGCCAATAAA6ACGATCCCTTCTACTATGACTGGAAAAACCTGCAG 

CTGAGCGGACTGATCTGCGGAGGGCTCCTGGCCATTGCTGGGATCGCGGCAGTTCTGAGTGG 

CAAATGCAAATACAAGAGCAGCCAGAAGCAGCACAGTCCTGTACCTGAGAAGGCCATCCCAC 

TCATCACTCCAGGCTCTGCCACTACTTGCTGAGCACAGGACTGGCCTCCAGGGATGGCCTGA 

AGCCTAACACTGGCCCCCAGCACCTCCTCCCCTGGGAGGCCTTATCCTCAAGGAAGGACTTC 

TCTCCAAGGGCAGGCTGTTAGGCCCCTTTCTGATCAGGAGGCTTCTTTATGAATTAAACTCG 
CCCCACCACCCCCTCA 
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FIGURE 180 

MERVTLALLLLAGLTALEANDPFANKDDPFYYDWKNLQLSGLICGGLLAIAGIAAVLSGKCK 
YKSSQKQHSPVPEKAI PLITPGSATTC 
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FIGURE 181 

GGAGAAGAGGTTGTGTGGGACAAGCTGCTCCCGACAGAAGGATGTCGCTGCTGAGCCTGCCC 

TGGCTGGGCCTCAGACCGGTGGCAATGTCCCCATGGCTACTCCTGCTGCTGGTTGTGGGCTC 

CTGGCTACTCGCCCGCATCCTGGCTTGGACCTATGCCTTCTATAACAACTGCCGCCGGCTCC 

AGTGTTTCCGACAGCCCCCAAAACGGAACTGGTTTTGGGGTCACCTGGGCCTGATCACTCCT 

ACAGAGGAGGGCTT'GAAGGACrCGACCCAGATGTCGGCCACCTATTCCCAGGGCTT^ 

ATGGCTGGGTCCCATCATCCCCTTCATCGTTTTATGCCACCCTGACACCATCCGGTCTATCA 

CCAATGCCTCAGCTGCCATTGCACCCAAGGATAATCTCTTCATCAGGTTCCTGAAGCCCTGG 

CTGGGAGAAGGGATACTGCTGAGTGGCGGTGACAAGTGGAGCCGCCACCGTCGGATGCTGAC 

GCCCGCCTTCCATTTCAACATCCTGAAGTCCTATATAACGATCTTCAACAAGAGTGCAAACA 

TCATGCTTGACAAGTGGCAGCACCTGGCCTCAGAGGGCAGCAGTCGTCTGGACATGTTTGAG 

CACATCAGCCTCATGACCTTGGACAGTCTACAGAAATGCATCTTCAGCTTTGACAGCCATTG 

TCAGGAGAGGCCCAGTGAATATATTGCCACCATCTTGGAGCTCAGTGCCCTTGTAGAGAAAA 

GAAGCCAGCATATCCTCCAGCACATGGACTTTCTGTATTACCTCTCCCATGACGGGCGGCGC 

TTCCACAGGGCCTGCCGCCTGGTGCATGACTTCACAGACGCTGTCATCCGGGAGCGGCGTCG 

CACCCTCCCCACTCAGGGTATTGATGATTTTTTCAAAGACAAAGCCAAGTCCAAGACTTTGG 

ATTTCATTGATGTGCTTCTGCTGAGCAAGGATGAAGATGGGAAGGCATTGTCAGATGAGGAT 

ATAAGAGCAGAGGCTGACACCTTCATGTTTGGAGGCCATGACACCACGGCCAGTGGCCTCTC 

CTGGGTCCTGTACAACCTTGCGAGGCACCCAGAATACCAGGAGCGCTGCCGACAGGAGGTGC 

AAGAGCTTCTGAAGGACCGCGATCCTAAAGAGATTGAATGGGACGACCTGGCCCAGCTGCCC 

TTCCTGACCATGTGCGTGAAGGAGAGCCTGAGGTTACATCCCCCAGCTCCCTTCATCTCCCG 

ATGCTGCACCCAGGACATTGTTCTCCCAGATGGCCGAGTCATCCCCAAAGGCATTACCTGCC 

TCATCGATATTATAGGGGTCCATCACAACCCAACTGTGTGGCCGGATCCTGAGGTCTACGAC 

CCCTTCCGCTTTGACCCAGAGAACAGCAAGGGGAGGTCACCTCTGGCTTTTATTCCTTTCTC 

cgcagggcccaggaactgcatcgggcaggcgttcgccatggcggagatgaaagtggtcctgg 
cgttgatgctgctgcacttccggttcctgccagaccacactgagccccgcaggaagctggaa 
ttgatcatgcgcgccgagggcgggctttggctgcgggtggagcccctgaatgtaggcttgca 
gtgactttctgacccatccacctgtttttttgcagattgtcatgaataa;^ 
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FIGURE 182 
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MSLLSLPWLGLRPVAMSPWLLLLLWGSWLLARILAWTYAFYNNCRRLQCFPQPPKRNWFWG 
HLGLITPTEEGLKDSTQMSATYSQGFTVWLGPI I PFIVLCHPDTIRSITNASAAIAPKDNLF 
IRFLKPWLGEGILLSGGDKWSRHRRMLTPAFHFNILKSYITIFNKSANIMLDKWQHLASEGS 
SRLDMFEHISLMTLDSIiQKCIFSFDSHCQERPSEYIATILELSALVEKRSQHII^HMDFLYY 
LSHDGRRFHRACRLVHDFTDAVIRERRRTLPTQGIDDFFKDKAKSKTLDFIDVLLLSKDEDG 
KALSDEDI RAEADTFMFGGHDTTASGLSWVLYNIiARHPEYQERCRQEVQELLKDRDPKE I EW 
DDLAQLPFLTMCVKESLRLHPPAPFISRCCTQDIVLPDGRVIPKGITCLIDIIGVHHNPTVW 
PDPEVYDPFRFDPENSKGRSPLAFIPFSAGPRNCIGQAFAMAEMKVVIALMLLHFRFLPDHT 
EPRRKLELIMRAEGGLWLRVEPLNVGLQ 
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FIGURE 183 

CAACAGAAGCCAAGAAGGAAGCCGTCTATCTTGTGGCGAT CATG TATAAGCTGGCCTCCTGC 

TGTTTGCrTTTCAa^GGATTCTTAAATCCTCTCTTATCTCTTCCTCTCCTTGACTCCAG^ 

AATATCCTTTCAACTCTCAGCACCTCATGAAGACGCGCGCTTAACTCCGGAGGAGCTAGAAA 

GAGCTTCCCTTCTACAGATATTGCCAGAGATGCTGGGTGCAGAAAGAGGGGATATTCTCAGG 

AAAGCAGACTCAAGTACCAACATTTTTAACCCAAGAGGAAATTTGAGAAAGTTTCAGGATTT 

CTCTGGACAAGATCCTAACATTTTACTGAGTCATCTTTTGGCCAGAATCTGGAT^CCATACA 

AGAAACGTGAGACTCCTGATTGCTTCTGGAAATACTGTGTCTGAAGTGAAATAAGCATCTGT 

TAGTCAGCTCAGAAACACCCATCTTAGAATATGAAAAATAACACAATGCTTGATTTGAAAAC 

AGTGTGGAGAAAAACTAGGCAAACTACACCCTGTTCATTGTTACCTGGAAAATAAATCCTCT 

ATGTTTTGCACAAAAAAAAAAAAAAA 
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FIGURE 184 

MYKLAS CCLLFTGFLNPLLSLPLLDSRE I S FQLS APHEDARLTPEELERASLLQ I L PEMLGA 
ERGDILRKADSSTNIFNPRGNLRKFQDFSGQDPNILLSHLLARIWKPYKKRETPDCFWK^ 
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FIGURE 185 

GAACATTTTTAGTTCCCAAGGAATGTACATCAGCCCCACGGAAGCTAGGCCACCTCTGGGAT 
GGGGTTGCTGGTTTAAAACAAACGCCAGTCATCCTATATAAGGACCTGACAGCCACCAGGCA 
CCACCTCCGCCAGGAACTGCAGGCCCACCTGTCTGCAACCCAGCTGAGGC CATG CCCTCCCC 
AGGGACCGTCTGCAGCCTCCTGCTCCTCGGCATGCTCTGGCTGGACTTGGCCATGGCAGGCT 
CCAGCTTCCTGAGCCCTGAACACCAGAGAGTCCAGCAGAGAAAGGAGTCGAAGAAGCCACC^ 
GCCAAGCTGCAGCCCCGAGCTCTAGCAGGCTGGCTCCG.CCCGGAAGATGGAGGTCAAGCAGA 
AGGGGCAGAGGATGAACTGGAAGTCCGGTTCAACGCCCCCTTTGATGTTGGAATCAAGCTGT 
CAGGGGTTCAGTACCAGCAGCACAGCCAGGCCCTGGGGAAGTTTCTTCAGGACATCCTCTGG 
GAAGAGGCCAAAGAGGCCCCAGCCGACAAG TGA TCGCCCACAAGCCTTACTCACCTCTCTCT 
AAGTTTAGAAGCGCTCATCTGGCTTTTCGCTTGCTTCTGCAGCAACTCCCACGACTGTTGTA 
CAAGCTCAGGAGGCGAATAAATGTTCAAACTGTA 
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FIGURE 186 

MPSPGTVCSLLLLGMLWLDIJ^GSSFLSPEHQRVQQRKESKKPPAKLQPRALAGWLRPEDG 
GQAEGAEDELEVRFNAPFDVGIKLSGVQYQQHSQALGKFLQDILWEEAKEAPADKO 
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FIGURE 187 

CGGCCACAGCTGGCATGCTCTGCCTGATCGCCATCCTGCTGTATGTCCTCGTCCAGTACCTC 
GTGAACCCCGGGGTGCTCCGCACGGACCCCAGATGTCAAGAATATGAACACGTGGCTGCTGT 
TCCTCCCCCTGTTCCCGGTGCAGGTGCAGACCCTGATAGTCGTGATCATCGGGATGCTCGTG 
OTCCTGCTGGACTTTCTTGGCTTGGTGCACCTGGGCCAGCTGCTCATCTTCCACATCTACCT 
GAGTATGTCCCCCACCCTAAGCCCCCGATCCCCCCAAGGCTGGGTGGTCAGAGCTGCTCATC 
TTACACCTCTACTTGAGTATGTCCCTAACCCTGAGCCCCCCACGCCTGGGGCCAGAGTCTTT 
GTCCCCCGTGTGCGCATGTGTTCAGGGTCAGCCTCTCCCAGAAGTGAGATCATGGACAAAAA 
GGGCAAATCACAGGAAGAAATTAAATCCATGAGGACCCAGCAGGCCCAGCAAGAAGCTGAAC 
TCACGCCGAGACCTGCAGGAGTGGTGCCAGGTGCTTGAAGTAACAAGTTTAAAATGTTCAGA 
GACAATGGAATGGAATCTATTAGGCAAGAACAGGACATTATGAAATAAGGACAGGTGGACTT 
CCAAAAACACAAGTAGAAATTCTAACAATGAAATATATTACAGGCAGGTCACCCACTAACCA 
AACAACTGAAGCGAGAGCTGTGGTCTTGCTTGGTCTCACAGTGGGCACAGCGGTAGGCGGTC 
AGTCATGTTGCTGAACGACGGAGGGTAAACTCCCCAGCCCCAAGAAAACCTGTGTTGGAAGT 
AACAACAACCTCCCTGCTCCTGGCACCAGCCGTTTTGGTCATGGTGGGCCAGCTGCAAAGCG 
TCTTCCATTCTCTGGGCAGTGGTGGCCCCGAGGCTGTGGCCrrCTCAGGGGGTTTCTGTGGAC 
ACGGGCAGCAGAGTGTGTCCAGGCCAGCCCCCAAGAATGCCCTGCTCCTGACAGCTTGGCCA 
ACCCCTGGTCAGGGCAGAGGGAGTTGGGTGGGTCAGGCTCTGGGCTCACCTCCATCTCCAGA 
GCATCCCCTGCCTGCAGTTGTGGCAAGAACGCCCAGCTCAGAATGAACACACCCCACCAAGA 
GCCTCCTTGTTCATAACCACAGGTTACCCTACAAACCACTGTCCCCACACAACCCTGGGGAT 
GTTTTAAAACACACACCTCTAACGCATATCTTACAGTCACTGTTGTCTTGCCTGAGGGTTGA 
ATTTTTTTTAATGAAAGTGCAATGAAAATCACTGGATTAAATCCTACGGACACAGAGCTGAA 
AAAAAAAAT^AAAAAAAAAAAAAAAAAAAAA 
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FIGURE 188 

MNTWLLFLPLFPVQVQTLIWIIGMLVLLLDFLGLVHLGQLLIFHIYLSMSPTLSPRSPQGW 
VVRAAHLTPLLEYVPNPEPPTPGARVFVPRVRMCSGSASPRSE IMDKKGKSQEE IKSMRTQQ 
AQQEAELTPRPAGWPGA 
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FIGURE 189 

GGAGTGCAGATGGCATCCTTCGGTTCTTCCAGACAAGCTGCAAGACGCTGACCATG^ 

ATGGAGCTCTCGAAGGCCTTCTCTGGCCAGCGGACACTCCTATCTGCCATCCTCAGC^^ 

ATCACTCAGCTTCTCCACAACATCCCTGCTCAGCAACTACTGGTTTGTG^ 

TGCCCAAGCCCCTGTGCGAGAAAGGTCTGGCAGCCAAGTGCTTTGACATGCCAGTGTCCCTG 

GATGGAGATACCAACACATCCACCCAGGAGGTGGTACAATACAACTGGGAGACTGGGGATGA 

CCGGTTCTCCTTCCGGAGCTTCCGGAGTGGCATGTGGCTATCCTGTGAGGAAACTGTGGAAG 

AACCAGGGGAGAGGTGCCGAAGTTTCATTGAACTTACACCACCAGCCAAGAGAGGTGAGAAA 

GGACTACTGGAATTTGCCACGTTGCAAGGCCCATGTCACCCCACTCTCCGATTTGGAGGGAA 

GCGGTTGATGGAGAAGGCTTCCCTCCCCTCCCCTCCCTTGGGGCTTTGTGGCAAAAATCCTA 

TGGTTATCCCTGGGAACGCAGATCACCTACATCGGACTTCAATTCATCAGCTTCCTCCTGCT 

ACTAACAGACTTGCTACTCACTGGGAACCCTGCCTGTGGGCTCAAACTGAGCGCCTTTGCTG 

CTGTTTCCTCTGTCCTGTCAGGTCTCCTGGGGATGGTGGCCCACATGATGTATTCACAAGTC 

TTCCAAGCGACTGTCAACTTGGGTCCAGAAGACTGGAGACCACATGTTTGGAATTATGGCTG 

GGCCTTCTACATGGCCTGGCTCTCCTTCACCTGCTGCATGGCGTCGGCTGTCACCACCTTCA 

ACACGTACACCAGGATGGTGCTGGAGTTCAAGTGCAAGCATAGTAAGAGCTTCAAGGAAAAC 

CCGAACTGCCTACCACATCACCATCAGTGTTTCCCTCGGCGGCTGTCAAGTGCAGCCCCCAC 

CGTGGGTCCTTTGACCAGCTACCACCAGTATCATAATCAGCCCATCCACTCTGTCTCTGAGG 

GAGTCGACTTCTACTCCGAGCTGCGGAACAAGGGATTTCAAAGAGGGGCCAGCCAGGAGCTG 

Ty^GAAGCAGTTAGGTCATCTGTAGAGGAAGAGCAGTGTTAGGAGTTAAGCGGGTTTGGGGA 

GTAGGCTtGAGCCCTACCTTACACGTCTGCTGATTATCAACATGTGCTTAAGCCAACATCCG 

TCTCTTGAGCATGGTTTTTAGAGGCTACGAATAAGGCTATGAATAAGGGTTATCTTTAAGTC 

CTAAGGGATTCCTGGGTGCCACTGCTCTCTTTTCCTCTACAGCTCCATCTTGTTTCACCCAC 

CCCACATCTCACACATCCAGAATTCCCTTCTTTACTGATAGTTTCTGTGCCAGGTTCTGGGC 

TA/^CCATGGAGATAAAAAGAAGAGTAAAATACACTTCCCGACCTTAAGGATCTGAAA 
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FIGURE 190 

MAKMELSKAFSGQRTLLSAILSMLSLSFSTTSLLSNYWFVGTQKVPKPLCEKGLAAKCFDMP 
VSLDGDTNTSTQEWQYNWETGDDRFSFRSFRSGMWLSCEETVEEPGERCRSFIELTPPAKR 
GEKGLLEFATLQGPCHPTLRFGGKRLMEKASLPSPPLGLCGKNPMVIPGNADHLHRTSIHQL 
PPATNRLATHWEPCLWAQTERLCCCFLCPVRSPGDGGPHDVFTSLPSDCQLGSRRLETTCLE 
LWLGLLHGLALLHLLHGVGCHHLQHVHQDGAGVQVQA 
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FIGURE 191 

AACTGGAAGGAAAGAAAGAAAGGTCAGCTTTGGCCCAGATGTGGTTACCCCTTGGTCTCCTG 

TCTTTATGTCTTTCTCCTCTTCCTATTCTGTCATCTCCCTCACTTAA^ 

GCAGCTCCTGTGGACy^TTGCOVTCCCCTCTGGTAGCCTTCAGAGCAAACAGG 

TTATGGATGTTTCCACCAACCAGGGTAGTGGCATGGAGCACCGTAACCATCTGTGCT^ 

GATCTCTATGACAGAGCCACTTCTCCACCTCTGAAATGTTCCCTGCTCTGAAATCTGGCATG 

AGATGGCACAGGTGACCACGO^GAAGCCACCAGAATCTTGCCTGCCCTATTCCTCCTCCCAA 

GTCTGTTCTCTTATTGTCAACCTCAGCACAACAGGCTGGCGCCAATGGCATTACAGAGAAAG 

CAATCTGTGTGGCTAGTGGGCAGATTACCATGCAAGCCCCAGGAGAAATGGAGGAGCTTTGT 

AGCCACCTCCCTGTCAGCCAGTATTAACATGTCCCCTTCCCCCTGCCCCGCCGTAGATTCAG 

GACATTCGCCCCTGTGTGCCACCAAACCAGGACTTTCCCCTTGGCTTGGCATCCCTGGCTCT 

CTCCTGGTACCCAGCAAGACGTCTGTTCCAGGGCAGTGTAGCATCTTTCAAGCTCCGTTACT 

ATGGCGATGGCCATGATGTTACAATCCCACTTGCCTGAATAATCAAGTGGGAAGGGGAAGCA 

GAGGGAAATGGGGCCATGTGAATGCAGCTGCTCTGTTCTCCCTACCCTGAGGAAAAACCAAA 

GGGAAGCAACAGGAACTTCTGCAACTGGTTTTTATCGGAAAGATCATCCTGCCTGCAGATGC 

TGTTGAAGGGGCACAAGAAATGTAGCTGGAGAAGATTGATGAAAGTGCAGGTGTGTAAGGAA 

ATAGAACAGTCTGCTGGGAGTCAGACCTGGAATTCTGATTCCAAACTCTTTATTACTTTGGG 

AAGTCACTCAGCCTCCCCGTAGCCATCTCCAGGGTGACGGAACCCAGTGTATTACCTGCTGG 

AACCAAGGAAACTAACAATGTAGGTTACTAGTGAATACCCCAATGGTTTCTCCAATTATGCC 

CATGCCACCAAAACAATAAAACAAAATTCTCTAACACTGAAA 
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MWLPLGLLSLCLSPLPILSSPSLKSQACQQLLWTLPSPLVAFRANRTTYVMDVSTNQGSGME 
HRNHLCFCDLYDRATSPPLKCSLL 
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GTAGCGCGTCTTGGGTCTCCCGGCTGCCGCTGCTGCCGCCGCCGCCTCGGGTCGTGGAGCCA 

GGAGCGACGTCACCGCCATGGCAGGCATCAAAGCTTTGATTAGTTTGTCCTTTGGAGGAGCA 

ATCGGACTGATGTTTTTGATGCTTGGATGTGCCCTTCCAATATACAACAAATACTGGC 

CTTTGTTCTATTTTTTTACATCCTTTCACCTATTCCATACTGCATAGCAAGAAGATTAGTGG 

ATGATACAGATGCTATGAGTAACGCTTGTAAGGAACTTGCCATCTTTCTTACAACGGGCATT 

GTCGTGTCAGCTTTTGGACTCCCTATTGTATTTGCCAGAGCACATCTGATTGAGTGGGGAGC 

TTGTGCACTTGTTCTCACAGGAAACACAGTCATCTTTGCAACTATACTAGGCTTTTTCTTGG 

TCTTTGGAAGCAATGACGACTTCAGCTGGCAGCAGTGGTGAA?^GAAATTACTGAACTATTG 

TCAAATGGACTTCCTGTCATTTGTTGGCCATTCACGCACACAGGAGATGGGGCAGTTAATGC 

TGAATGGTATAGCAAGCCTCTTGGGGGTATTTTAGGTGCTCCCTTCTCACTTTTATTGTAAG 

CATACTATTTTCACAGAGACTTGCTGAAGGATTAAAAGGATTTTCTCTTTTGGAAAAGCTTG 

ACTGATTTCACACTTATCTATAGTATGCTTTTTGTGGTGTCCTGCTGAATTTAAATATTTAT 

GTGTTTTTCCTGTTAGGTTGATTTTTTTTGGAATCAATATGCAATGTTAAACACTTTTTTAA 

TGTAATCATTTGCATTGGTTAGGAATTCAGAATTCCGCCGGCTCTATTACTGGTCAAGTACA 

TCTTTTCTCTTAAAATTATTTAGCCTCCATTATTACAAAAAATTATAAAAATAAGTTTTCAG 

TCAGTCAGGATGACATCACTCCCAATGTTATGCAGACATACAGACGGTTGGCATACGTTATA 

GACTGTATACTCAGTGCAAATATAGCTGCATTTATACCTCAGAGGGGCCAAGTGTTAATGCC 

CATGCCCTCCGTTAAGGGTTGTTGGTTTTACTGGTAGACAGATGTTTTGTGGATTGAAAATT 

ATTTTATGGAATTGCTACAGAGGAGTGCTTTTCTTCTCAATTGTTAGAAGAATTTATGTTAA 

ACTTTAAGGTAAGGGTGTAAAAACATTTTTGAGATAAGGTTTTTATTTATGTTTATTATTGT 

TAGAGTGAGTTGCAATGTGGGAAGAAATGACATTGAAATTCCAGTTTTTGAATCCTGTTTCT 

ATTTATAAGTGAAATTTGTGATCTCCTATCAACCTTTCATGTTTTACCCTGTTAT^TGGAC 

ATACATGGAACCACTACTGATGAGGGACAGTTGTATGTTTGCATCATATATGCCAGAAAACC 

TTCCTCTGCTTCCTCCTTTTGACTTATTTGGTATGTTGTATATATTACATAAAATAACTTTT 

CAAATATAGTTTAATAACACTTAGAAGTGTTTACTTACCTGGAAAATAATTGCTATGCCGTA 

CATTCAGAGTGCCCCCTCCCCTGCAAGGCCTTGCCATGATTAACAAGTAACTTGTTAGTCTT 

ACAGATAATTCATGCATTAACAGTTTAAGATTTAGACCATGGTAATAGTAGTTCTTATTCTC 

TAAGGTTATATCATATGTAATTTAAAAGTATTTTTAAGACAAGTTTCCTGTATACCTCTGAA 

CTGTTTTGATTTTGAGTTCATCATGATAGATCTGCTGTTTCCTTATAAAAGGCATTTGTTGT 

GTGAGTTAATGCAAAGTAGCCAAGTCCAGCTATATAGCAGCTTCAGAAACATACCTGACCAA 

AAAATTCCCAGTAACCAGGCATGATCAATTTATAGTGGTCGTTTACATCTAATAATTATCAG 

GACTTTTTTCAGGAGTGGGTTATAAAAACATTCAAGTTGGTCTGACAGTATTTTGTTAAGGA 

TATTTGTTTGTATGTTTATTCAGTATACTTACATAAAAATTATTTCGCCATCAGCCAT^CT 

CAGTAATCATGACAGCTGTCTGTTGTTTTATGAAGTTTATTTCTCAAGAAAATGGGAATAAA 

TTTGGGATTTGTTCAGCTTTTTTACTAAAGATGCCTAT^GCCACAGGTTTTATTGCCTAACT 

TAAGCCATGACTTTTAGATATGAGATGACGGGAAGCAGGACGAAATATCGGCGTGTGGCTGG 

AGCCTTCCCACTGGAGGCTGAAAGTGGCTTGTGGTATTATAATGTTCAGATTTCAAGAGGAA 

GGTGCAGGTACACATGAGTTAGAGAGCTGGTGAGACAGTTGGGAACTCTTTGTGCTTGTGAT 

CTACTGGACTTTTTTTTTGCAGGAAGTGCATTCTCTGGTCCTTCCCTATTTTCTGTTCTGGA 

TGTCAGTGCAGTGCACTGCTACTGTTTTATCCACTTGGCCACAGACTTTTTCTAACAGCTGC 

GTATTATTTCTATATACTAATTGCATTGGCAGCATTGTGTCTTTGACCTTGTATACTAGCTT 

GACATAGTGCTGTCTCTGATTTCTAGGCTAGTTACTTGAGATATGAATTTTCCATAGT^TAT 

GCACTGATACAACATTACCATTCTTCTATGGAAAGAAAACTTTTGATGATGAAACAATAAAG 

ATTTTAAATATCTATTTTAAAAAAAAAA 
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MAGI KALI SLSFGGAIGIJ^FLMI/5CALP I YNKYWPLFVLFFY I LS P I PYC I ARRLVDDTD^ 
SNACKELAIFLTTGIWSAFGLPIVFARAHLIEWGACALVLTGNTVIFATILGFFLVFGSND 
DFSWQQW 
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FIGURE 195A 

CCCACGCGTCCGCCCACGCGTCCGCCCACGCGTCCGCCCACGCGTCCGCCCACGCGTCCGCC 

CACGCGTCCGCCCACGCGTCCGGTGCAAGCTCGCGCCGCACACTGCCTGGTGGAGGGAAGGA 

GCCCGGGCGCCTCTCGCCGCTCCCCGCGCCGCCGTCCGCACCTCCCCACCGCCCGCCGCCCG 

CCGCCCGCCGCCCGCAAAGCATGAGTGAGCCCGCTCTCTGCAGCTGCCCGGGGCGCGAATGG 

CAGGCTGTTTCCGCGGAGTAAAAGGTGGCGCCGGTCAGTGGTCGTTTCCAATGACGGACATT 

AACaVGACTGTCAGATCCTGGGGAGTCGCGAGCCCCGAGTTTGGAGTTTTTTCCCCCCACAA 

CGTCACAGTCCGAACTGCAGAGGGAAAGGAAGGCGGCAGGAAGGCGAAGCTCGGGCTCCGGC 

ACGTAGTTGGGAAACTTGCGGGTCCTAGAAGTCGCCTCCCCGCCTTGCCGGCCGCCCTTGCA 

GCCCCGAGCCGAGCAGCAAAGTGAGACATTGTGCGCCTGCCAGATCCGCCGGCCGCGGACCG 

GGGCTGCCTCGGAAACACAGAGGGGTCTTCTCTCGCCCTGCATATAATTAGCCTGCACACAA 

AGGGAGCAGCTGAATGGAGGTTGTCACTCTCTGGAAAAGGATTTCTGACCGAGCGCTTCCAA 

TGGACATTCTCCAGTCTCTCTGGAAAGATTCTCGCTAATGGATTTCCTGCTGCTCGGTCTCT 

GTCTATACTGGCTGCTGAGGAGGCCCTCGGGGGTGGTCTTGTGTCTGCTGGGGGCCTGCTTT 

CAGATGCTGCCCGCCGCCCCCAGCGGGTGCCCGCAGCTGTGCCGGTGCGAGGGGCGGCTGCT 

GTACTGCGAGGCGCTCAACCTCACCGAGGCGCCCCACAACCTGTCCGGCCTGCTGGGCTTGT 

CCCTGCGCTACAACAGCCTCTCGGAGCTGCGCGCCGGCCAGTTCACGGGGTTAATGCAGCTC 

ACGTGGCTCTATCTGGATCACAATCACATCTGCTCCGTGCAGGGGGACGCCTTTCAGAAACT 

GCGCCGAGTTAAGGAACTCACGCTGAGTTCCAACCAGATCACCCAACTGCCCAACACCACCT 

TCCGGCCCATGCCCAACCTGCGCAGCGTGGACCTCTCGTACAAC/y^GCTGCAGGCGCTCGCG 

CCCGACCTCTTCCACGGGCTGCGGAAGCTCACCACGCTGCATATGCGGGCCAACGCCATCCA 

GTTTGTGCCCGTGCGCATCTTCCAGGACTGCCGCAGCCTCAAGTTTCTCGACATCGGATACA 

ATCAGCTCAAGAGTCTGGCGCGCAACTCTTTCGCCGGCTTGTTTAAGCTCACCGAGCTGCAC 

CTCGAGCACAACGACTTGGTCAAGGTGAACTTCGCCCACTTCCCGCGCCTCATCTCCCTGCA 

CTCGCTCTGCCTGCGGAGGAACAAGGTGGCCATTGTGGTCAGCTCGCTGGACTGGGTTTGGA 

ACCTGGAGAAAATGGACTTGTCGGGCAACGAGATCGAGTACATGGAGCCCCATGTGTTCGAG 

ACCGTGCCGCACCTGCAGTCCCTGCAGCTGGACTCCAACCGCCTCACCTACATCGAGCCCCG 

GATCCTCAACTCTTGGAAGTCCCTGACAAGCATCACCCTGGCCGGGAACCTGTGGGATTGCG 

GGCGCAACGTGTGTGCCCTAGCCTCGTGGCTCAGCAACTTCCAGGGGCGCTACGATGGCAAC 

TTGCAGTGCGCCAGCCCGGAGTACGCACAGGGCGAGGACGTCCTGGACGCCGTGTACGCCTT 

CCACCTGTGCGAGGATGGGGCCGAGCCCACCAGCGGCCACCTGCTCTCGGCCGTCACCAACC 

GCAGTGATCTGGGGCCCCCTGCCAGCTCGGCCACCACGCTCGCGGACGGCGGGGAGGGGCAG 

CACGACGGCACATTCGAGCCTGCCACCGTGGCTCTTCCAGGCGGCGAGCACGCCGAGAACGC 

CGTGCAGATCCACAAGGTGGTCACGGGCACCATGGCCCTCATCTTCTCCTTCCTCATCGTGG 

TCCTGGTGCTCTACGTGTCCTGGAAGTGTTTCCCAGCCAGCCTCAGGCAGCTCAGACAGTGC 

TTTGTCACGCAGCGCAGGAAGCAAAAGCAGAAACAGACCATGCATCAGATGGCTGCCATGTC 

TGCCCAGGAATACTACGTTGATTACAAACCGAACCACATTGAGGGAGCCCTGGTGATCATCA 

ACGAGTATGGCTCGTGTACCTGCCACCAGCAGCCCGCGAGGGAATGCGAGGTGTGATTGTCC 

CAGTGGCTCTCAACCCATGCGCTACCAAATACGCCTGGGCAGCCGGGACGGGCCGGCGGGCA 

CCAGGCTGGGGTCTCCTTGTCTGTGCTCTGATATGCTCCTTGACTGAAACTTTAAGGGGATC 

TCTCCCAGAGACTTGACATTTTAGCTTTATTGTGTCTTAAAAACAAAAGCGAATTAAAACAC 

AACAAAAAACCCCACCCCACAACCTTCAGGACAGTCTATCTTAAATTTCATATGAGAACTCC 

TTCCTCCCTTTGAAGATCTGTCCATATTCAGGAATCTGAGAGTGTAAAAAAGGTGGCCATAA 

GACAGAGAGAGAATAATCGTGCTTTGTTTTATGCTACTCCTCCCACCCTGCCCATGATTAAA 

CATCATGTATGTAGAAGATCTTAAGTCCATACGCATTTCATGAAGAACCATTGGAAAGAGGA 

ATCTGCAATCTGGGAGCTTAAGAGCAAATGATGACCATAGAAAGCTATGTTCTTACTTTGTG 

TGTGTGTCTGTATGTTTCTGCGTTGTGTGTCTTTGTAGGCAAGCAAACGTTGTCTACACAAA 

CGGGAATTTAGCTCACATCATTTCATGCCCCTGTGCCTCTAGCTCTGGAGATTGGTGGGGGG 

AGGTGGGGGGAAACGGCAGGAATAAGGGAAAGTGGTAGTTTTAACTAAGGTTTTGTAACACT 

TGAAATCTTTTCTTTCTCAAATTAATTATCTTTAAGCTTCAAGATVACTTGCTCTGACCCCTC 

TAAGCAAACTACTAAGCATTTAAAAGAGAATCTAATTTTTAAAGGTGTAGCACCTTTTTTTT 

TATTCTTCCCACAGAGGGTGCTAATCTCATTATGCTGTGCTATCTGAAAAGAACTTAAGGCC 

ACAATTCACGTCTCGTCCTGGGCATTGTGATGGATTGACCCTCCATTTGCAGTACCTTCCCA 

GCTGATTAAAGTTCAGCAGTGGTATTGAGGTTTTTCGAATATTTATATAGAAAT^^GTCTT 

TTCACATGACAAATGACACTCTCACACCAGTCTTAGCCCTAGTAGTTTTTTAGGTTGGACCA 

GAGGAAGCAGGTTAAATGAGACCTGTCCTCTGCTGCACTCAGAAAAAATAGGCAGTCCCTGA 

TGCTCAGATCTTAGCCTTGATATTAATAGTTGAGACCACCTACCCACAATGCAGCCTATACT 

CCCAAGACTACAAAGTTACCATCGCAAAGGAAAGGTTATTCCAGTAAAAGGAAATAGTTTTC 

TCAACCATTTAAAAATATTCTTCTGAACTCATOVAAGTAGAAGAGCCCCCAACCTTTTCTCT 

CTGCCTTCAAGAAGGCAGACATTTGGTATGATTTAGCATCAACAACACATTTATGAGTA 
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FIGURE 195B 

GTAAGTAATCAGAGGGGCAAATGCCACTTGTTATTCCTCCCAAGTTTTCC7AGCAAGTACAC 
ACAGATCTCTGGTAGGATTAGGGGCCACTTGTGTTTCCGGCTTATTTTAGTCGACTTGTCAG 
CAAGTTTGATGCCTAGTCTATCTGACATGGCCCAGTAGAACAGGGCATTGATGGATCACATG 
AGATGGTAGAAGGAACATCATCACATACCCCTCTCACAGAGAAAATTATCA7VAGAACCAGAA 
ATTATATCTGTTTTGGAGCAAGAGTGTCATAATGTTTCAGGGTAGTCAAAATAAACATAAAT 
TATCTCCTCTAGATGAGTGGCGATGTTGGCTGATTTGGGTCTGCCATTGACAGAATGTCAAA 
TAAAAAGGAATTAGCTAGAATATGACCATTAAATGTGCTTCTGAAATATATTTTGAGATAGG 
TTTAGAATGTCA 
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FIGURE 196 

MDFLLLGLCLYWLLRRPSGWLCLLGACFQMLPAAPSGCPQLCRCEGRLLYCEALNLTEAPH 
NLSGLLGLSLRYNSLSELRAGQFTGLMQLTWLYLDHNHICSVQGDAFQKLRRVKELTLSSNQ 
ITQLPNTTFRPMPNLRSVDLSYNKLQALAPDLFHGLRKLTTLHMRANAIQFVPVRIFQDCRS 
LKFUDIGYNQLKSLARNSFAGLFKLTELHLEHNDLVKVNFAHFPRLISLHSLCIJIRNKVAIV 
VSSIJJWVWNLEKMDLSGNEIEYMEPHVFETVPHLQSLQLDSNRLTYIEPRILNSWKSLTSIT 
LAGNLWDCGRNVCALASWLSNFQGRYDGNLQCASPEYAQGEDVLDAVYAFHLCEDGAEPTSG 
HLLSAVTNRSDLGPPASSATTLADGGEGQHDGTFEPATVALPGGEHAENAVQIHKVVTGTMA 
LI FS FLI WLVLYVSWKCFPASLRQLRQCFVTQRRKQKQKQTMHQMAAMSAQEYYVDYKPNH 
lEGALVI INEYGSCTCHQQPARECEV 



WSaXXa. <WO_99830e8A2J.> 
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GTGCAAGGAGCCGAGGCGAG&IgGGCGTCCTGGGCCGGGTCCTGCTGTGGCTGCAGCTCTGC 
GCACTGACCCAGGCGGTCTCCAAACTCTGGGTCCCCAACACGGACTTCGACGTCGCAGCCAA 
CTGGAGCCAGAACCGGACCCCGTGCGCCGGCGGCGCCGTTGAGTTCCCGGCGGACAAGATGG 
TGTCAGTCCTGGTGCAAGAAGGTCACGCCGTCTCAGACATGCTCCTGCCGCTGGATGGGGAA 
CTCGTCCTGGCTTCAGGAGCCGGATTCGGCGTCTCAGACGTGGGCTCGCACCTGGACTGTGG 
CGCGGGCGAACCTGCCGTCTTCCGCGACTCTGACCGCTTCTCCTGGCATGACCCGCACCTGT 
GGCGCTCTGGGGACGAGGCACCTGGCCTCTTCTTCGTGGACGCCGAGCGCGTGCCCTGCCGC 
CACGACGACGTCTTCTTTCCGCCTAGTGCCTCCTTCCGCGTGGGGCTCGGCCCTGGCGCTAG 
CCCCGTGCGTGTCCGCAGCATCTCGGCTCTGGGCCGGACGTTCACGCGCGACGAGGACCTGG 
CTGTTTTCCTGGCGTCCCGCGCGGGCCGCCTACGCTTCCACGGGCCGGGCGCGCTGAGCGTG 
GGCCCCGAGGACTGCGCGGACCCGTCGGGCTGCGTCTGCGGCAACGCGGAGGCGCAGCCGTG 

GATCTGCGCGGCCCTGCTCCAGCCCCT 
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FIGURE 198 

MGVLGRVLLWLQLCALTQAVSKLWVPNTDFDVAANWSQimTPCAGGAVEFPADK^^ 
GHAVSDMLLPIJDGELVIiASGAGFGVSDVGSHLDCGAGEPAVFRDSDRFSWHDPHLWRSGDEA 
PGLFFVDAERVPCRHDDVFFPPSASFRVGLGPGASPVRVRSISALGRTFTRDEDLAVFLASR 
AGRLRFHGPGT^SVGPEDCADPSGCVCGNAEAQPWI CAALLQP 

) 
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ATCGCATCAATTGGGAGTACCATCTTCCTCATGGGACCAGTGAAACAGCTGAAGCGAATGTT 

TGAGCCTACTCGTTTGATTGCAACrATCATGGTGCTGTTGTGTTTTGCACTO 

CTGCCTTTTGGTGGCATAACAAGGGACTTGCACTTATCTTCTGCATTTTGCAGTCTTTGGCA 

TTGACGTGGtACAGCCTTTCCTTCATACCATTTGCAAGGGATGCTGTGAAGAAGTGTTTTGC 

rnTGTGTCTTGC ATAAT TCATGGCCAGTTTTATGAAGCTTTGGAAGGCACTATGGACAGAAG 

CTGGTGGACAGTTTTGTAACTATCTTCGAAACCTCTGTCTTACAGACATGTGCCTTTTATCT 

TGCAGCAATGTGTTGCTTGTGATTCGAACATTTGAGGGTTACTTTTGGAAGCAACAATACAT 

TCTCGAACCTGAATGTCAGTAGCACAGGATGAGAAGTGGGTTCTGTATCTTGTGGAGTGGAA 

TCTTCCTCATGTACCTGTTTCCTCTCTGGATGTTGTCCCACTGAATTCCCATGAATACAAAC 

CTATTCAGCAACAGCAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAATU^AAAAAAA 

AAAi\AAAAAAAAAAAAAAAA?^AAA 
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MGPVKQLKRMFEPTRLIATIIWLLCFALTLCSAFWWHNKGIALIFCILQSLALT^ 
FARDAVKKCFAVCIiA 
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FIGURE 201 

TTGAGCGCAGGTGAGCTCCTGCGCGTTCCGGGGGCGTTCCTCCAGTCACCCTCCCGCCGTTA 

CCCGCGGCGCGCCCGAGGGAGTCTCCTCCAGACCCTCCCTCCCGTTGCTCCAAACTAATACG 

GACTGAACGGATCGCTGCGAGGGTGGGAGAGAAAATTAGGGGGAGAAAGGACAGAGAGAGCA 

ACTACCATCCATAGCCAGATAGATTATCTTACACTGAACTGATCAAGTACTTTGAAAATGAC 

TTCGAAATTTATCTTGGTGTCCTTCATACTTGCTGCACTGAGTCTTTCAACCACCTTTTCTC 

TCCAACTAGACCAGCAAAAGGTTCTACTAGTTTCTTTTGATGGATTCCGTTGGGATTACTTA 

TATAAAGTTCCAACGCCCCATTTTCATTATATTATGAAATATGGTGTTCACGTGAAGCAAGT 

TACTAATGTTTTTATTACAAAAACCTACCCTAACCATTATACTTTGGTAACTGGCCTCTTTG 

CAGAGAATCATGGGATTGTTGCAAATGATATGTTTGATCCTATTCGGAACAAATCTTTCTCC 

TTGGATCACATGAATATTTATGATTCCAAGTTTTGGGAAGAAGCGACACCAATATGGATCAC 

AAACCAGAGGGCAGGACATACTAGTGGTGCAGCCATGTGGCCCGGAACAGATGTAAAAATAC 

ATAAGCGCTTTCCTACTCATTACATGCCTTACAATGAGTCAGTTTCATTTGAAGATAGAGTT 

GCCAAAATTGTTGAATGGTTTACGTCAAAAGAGCCCATAAATCTTGGTCTTCTCTATTGGGA 

AGACCCTGATGACATGGGCCACCATTTGGGACCTGACAGTCCGCTCATGGGGCCTGTCATTT 

CAGATATTGACAAGAAGTTAGGATATCTCATACAAATGCTGAAAAAGGCAAAGTTGTGGAAC 

ACTCTGAACCTAATCATCACAAGTGATCATGGAATGACGCAGTGCTCTGAGGAAAGGTTAAT 

AGAACTTGACCAGTACCTGGATAAAGACCACTATACCCTGATTGATCAATCTCCAGTAGCAG 

CCATCTTGCCAAAAGAAGGTAAATTTGATGAAGTCTATGAAGCACTAACTCACGCTCATCCT 

AATCTTACTGTTTACAAAAAAGAAGACGTTCCAGAAAGGTGGCATTACAAATACAACAGTCG 

AATTCAACCAATCATAGCAGTGGCTGATGT^GGGTGGCACATTTTACAGAATAAGTCAGATG 

ACTTTCTGTTAGGCAACCACGGTTACGATAATGCGTTAGCAGATATGCATCCAATATTTTTA 

GCCCATGGTCCTGCCTTCAGAAAGAATTTCTCAAAAGAAGCCATGAACTCCACAGATTTGTA 

CCCACTACTATGCCACCTCCTCAATATCACTGCCATGCCACACAATGGATCATTCTGGAATG 

TCCAGGATCTGCTCAATTCAGCAATGCCAAGGGTGGTCCCTTATACACAGAGTACTATACTC 

CTCCCTGGTAGTGTTAAACCAGCAGAATATGACCAAGAGGGGTCATACCCTTATTTCATAGG 

GGTCTCTCTTGGCAGCATTATAGTGATTGTATTTTTTGTAATTTTCATTAAGCATTTAATTC 

ACAGTCAAATACCTGCCTTACAAGATATGCATGCTGA/^TAGCTCAACCATTATTACAAGCC 

TAATGTTACTTTGAAGTGGATTTGCATATTGAAGTGGAGATTCCATAATTATGTCAGTGTTT 

AAAGGTTTCAAATTCTGGGAAACCAGTTCCAAACATCTGCAGAAACCATTT^GCAGTTACAT 

ATTTAGGTATACACACACACACACACACACACATACACACACACGGACCAAAATACTTACAC 

CTGCAAAGGAATAAAGATGTGAGAGTATGTCTCCATTGTTCACTGTAGCATAGGGATAGATA 

AGATCCTGCTTTATTTGGACTTGGCGCAGATAATGTATATATTTAGCAACTTTGCACTATGT 

AAAGTACCTTATATATTGCACTTTAAATTTCTCTCCTGATGGGTACTTTAATTTGAAATGCA 

CTTTATGGACAGTTATGTCTTATAACTTGATTGAAAATGACAACTTTTTGCACCCATGTCAC 

AGAATACTTGTTACGCATTGTTCAAACTGAAGGAAATTTCTAATAATCCCGAATAATGAACA 

TAGAAATCTATCTCCATAAATTGAGAGAAGAAGAAGGTGATAAGTGTTGAAAATTAAATGTG 

ATAACCTTTGAACCITGAATTTTGGAGATGTATTCCCAACAGCAGAATGCAACTGTGGGC^ 

TTCTTGTCTTATTTCTTTCCAGAGAACGTGGTTTTCATTTATTTTTCCCTCAAAAGAGAGTC 

A7VATACTGACAGATTCGTTCTAAATATATTGTTTCTGTCATAAAATTATTGTGATTTCCTGA 

TGAGTCATATTACTGTGATTTTCATAATAATGAAGACACCATGAATATACTTTTCTTCTATA 

TAGTTCAGCAATGGCCTGAATAGAAGCAACCAGGCACCATCTCAGCAATGTTTTCTCTTGTT 

TGTAATTATTTGCTCCTTTGAAAATT/^TCACTATTAATTACATTAAAAATCAAATTGGAT 

AAAAAAAAAAAAAAAAAAA 
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MTSKFILVSFILAALSLSTTFSLQLDQQKVLLVSFDGFRWDYLYKVPTPHFHYIMKYGVHVK 
QVTNVFITKTYPNHYTLVTGLFAENHGIVANDMFDPIRNKSFSLDH^INIYDSKFWEEATPIW 
ITNQRAGHTSGAAMWPGTDVKIHKRFPTHYMPYNESVSFEDRVAKIVEWFTSKEPINLGLLY 
WEDPDDMGHHLGPDSP1J4GPVISDIDKKLGYLIQMLKKAKLWNTLNLIITSDHGMTQCSEER 
LIELDQYLDKDHYTLIDQSPVAAILPKEGKFDEVYEALTHAHPNLTVYKKEDVPERWHYKYN 
SRIQPIIAVADEGWHILQNKSDDFLLGNHGYDNALADMHPIFLAHGPAFRKNFSKEAMNSTD 
LYPLLCHLLNITAMPHNGSFWNVQDLLNSAMPRWPYTQSTILLPGSVKPAEYDQEGSYPYF 
IGVSLGSIIVIVFFVIFIKHLIHSQIPALQDMHAEIAQPLLQA 
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GGATTTTTGTGATCCGCGATTCGCTCCCACGGGCGGGACCTTTGTAACTGCGGGAGGCCCAG 

GACAGGCCCACCCTGCGGGGCGGGAGGCAGCCGGGGTGAGGGAGGTGAAGAAACCAAGACGC 

AGAGAGGCCAAGCCCCTTGCCTTGGGTCACACAGCCAAAGGAGGCAGAGCCAGAACTCACAA 

CCAGATCCAGAGGCAACAGGGACATGGCCACCTGGGACGAAAAGGCAGTCACCCGC^ 

AAGGTGGCTCCCGCTGAGAGGATGAGCAAGTTCTTAAGGCACTTCAC6GTCGTGGGAGACGA 

CTACCATGCCTGGAACATC/^CTACAAGAAATGGGAGAATGAAGAGGAGGAGGAGGAGGAGG 

AGCAGCCACCACCCACACCAGTCTCAGGCGAGGAAGGCAGAGCTGCAGCCCCTGACGTTGCC 

CCTGCCCCTGGCCCCGCACCCAGGGCCCCCCTTGACTTCAGGGGCATGTTGAGGAAACTGTT 

CAGCTCCCACAGGTTTCAGGTCATCATCATCTGCTTGGTGGTTCTGGATGCCCTCCTGGTGC 

TTGCTGAGCTCATCCrrGGACCTGAAGATCATCCAGCCCGACAAGAATAACTATGCTGCCATG 

GTATTCCACTACATGAGCATCACCATCTTGGTCTTTTTTATGATGGAGATCATCTTTAAATT 

ATTTGTCTTCCGCCTGAGTTCTTTCACCACAAGTTTGAGATCCTGGATGCCCGTCGTGGTGG 

TGGTCTCATTCATCCTGGACATTGTCCTCCTGTTCCAGGAGCACCAGTTTGAGGCTCTGGGC 

CTGCTGATTCTGCTCCGGCTGTGGCGGGTGGCCCGGATCATCAATGGGATTATCATCTCAGT 

TAAGACACGTTCAGAACGGCAACTCTTAAGGTTAAAACAGATGAATGTACAATTGGCCGCCA 

AGATTCAACACCTTGAGTTCAGCTGCTCTGAGAAGCCCCTGGACTGATGAGTTTGCTGTATC 

AACCTGTAAGGAGAAGCTCTCTCCGGATGGCTATGGGAATGAAAGAATCCGACTTCTACTCT 

CACACAGCCACCGTGAAAGTCCTGGAGTAAAATGTGCTGTGTACAGAAGAGAGAGAAGGAAG 

CAGGCTGGCATGTTCACTGGGCTGGTGTTACGACAGAGAACCTGACAGTCACTGGCCAGTTA 

TCACTTCAGATTACAAATCACACAGAGCATCTGCCTGTTTTCAATCACAAGAGAACAAAACC 

AAAATCTATAAAGATATTCTGAAAATATGACAGAATTTGACAAATAAAAGCATAAACGTGTA 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 204 

MATWDEKAVTRRAKVAPAERMSKFLRHFTVVGDDYHAWNINYKKWENEEEEEEEEQPPPTPV 
SGEEGRAAAPDVAPAPGPAPRAPLDFRGMLRKLFSSHRFQVI I ICLWLDALLVLAELILDL 
KIIQPDKNNYAAMVFHYMSITILVFFMMEIIFKLFVFRLSSFTTSLRSWMPVWWSFILDI 
VLLFQEHQFEALGLLILLRLWRVARI INGI IISVKTRSERQLLRLKQMNVQLAAKIQHLEFS 
CSEKPLD 
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FIGURE 205 

CGGCTCGAGCTCGAGCCGAATCGGCTCGAGGGGCAGTGGAGCACCCAGCAGGCCGCCAACAT 
GCTCTGTCTGTGCCTGTACGTGCCGGTCATCGGGGAAGCCCAGACCGAGTTCCAGTACTTTG 
AGTCGAAGGGGCTCCCTGCCGAGCTGAAGTCCATTTTCAAGCTCAGTGTCTTCATCCCCTCC 
CAGGAATTCTCCACCTACCGCCAGTGGAAGCAGAAAATTGTACAAGCTGGAGATAAGGACCT 
TGATGGGCAGCTAGACTTTGAAGAATTTGTCCATTATCTCCAAGATCATGAGAAGAAGCTGA 
GGCTGGTGTTTAAGATTTTGGACAAAAAGAATGATGGACGCATTGACGCGCAGGAGATCATG 
CAGTCCCTGCGGGACTTGGGAGTCAAGATATCTGAACAGCAGGa^GAATU^TTCTCAAGAG 
CATGGATAAAAACGGCACGATGACCATCGACTGGAACGAGTGGAGAGACTACCACCTCCTCC 
ACCCCGTGGAAAACATCCCCGAGATCATCCTCTACTGGAAGCATTCCACGATCTTTGATGTG 
GGTGAGAATCTAACGGTCCCGGATGAGTTCACAGTGGAGGAGAGGCAGACGGGGATGTGGTG 
GAGACACCTGGTGGCAGGAGGTGGGGCAGGGGCCGTATCCAGAACCTGCACGGCCCCCCTGG 
ACAGGCTCAAGGTGCTCATGCAGGTCCATGCCTCCCGCAGCAACAACATGGGCATCGTTGGT 
GGCTTCACTCAGATGATTCGAGAAGGAGGGGCCAGGTCACTCTGGCGGGGCAATGGCATCAA 
CGTCCTCAAAATTGCCCCCGi\ATCAGCC^TCAAATTCATGGCCTATGAGCAGATCAAGCGCC 
TTGTTGGTAGTGACCAGGAGACTCTGAGGATTCACGAGAGGCTTGTGGCAGGGTCCTTGGCA 
GGGGCCATCGCCCAGAGCAGCATCTACCCAATGGAGGTCCTGAAGACCCGGATGGCGCTGCG 
GAAGACAGGCCAGTACTCAGGAATGCTGGACTGCGCCAGGAGGATCCTGGCCAGAGAGGGGG 
TGGCCGCCTTCTACAAAGGCTATGTCCCCAACATGCTGGGCATCATCCCCTATGCCGGCATC 
GACCTTGCAGTCTACGAGACGCTCAAGAATGCCTGGCTGCAGCACTATGCAGTGAACAGCGC 
GGACCCCGGCGTGTTTGTGCTCCTGGCCTGTGGCACCATGTCCAGTACCTGTGGCCAGCTGG 
CCAGCTACCCCCTGGCCCTAGTCAGGACCCGGATGCAGGCGCAAGCCTCTATTGAGGGCGCT 
CCGGAGGTGACCATGAGCAGCCTCTTCAAACATATCCTGCGGACCGAGGGGGCCTTCGGGCT 
GTACAGGGGGCTGGCCCCCAACTTCATGAAGGTCATCCCAGCTGTGAGCATCAGCTACGTGG 
TCTACGAGAACCTGAAGATCACCCTGGGCGTGCAGTCGCGGTGACGGGGGGAGGGCCGCCCG 
GCAGTGGACTCGCTGATCCTGGGCCGCAGCCTGGGGTGTGCAGCCATCTCATTCTGTGAATG 
TGCCAACACTAAGCTGTCTCGAGCCAAGCTGTG/^AAACCCTAGACGCACCCGCAGGGAGGGT 
GGGGAGAGCTGGCAGGCCCAGGGCTTGTCCTGCTGACCCCAGCAGACCCTCCTGTTGGTTCC 
AGCGAAGACCACAGGCATTCCTTAGGGTCCAGGGTCAGCAGGCTCCGGGCTCACATGTGTAA 
GGACAGGACATTTTCTGCAGTGCCTGCCAATAGTGAGCTTGGAGCCTGGAGGCCGGCTTAGT 
TCTTCCATTTCACCCTTGCAGCCAGCTGTTGGCCACGGCCCCTGCCCTCTGGTCTGGCGTGC 
ATCTCCCTGTGCCCTCTTGCTGCCTGCCTGTCTGCTGAGGTAAGGTGGGAGGAGGGCTACAG 
CCCACATCCCACCCCCTCGTCCAATCCCATAATCCATGATGAAAGGTGAGGTCACGTGGCCT 
CCCAGGCCTGACTTCCCAACCTACAGCATTGACGCCAACTTGGCTGTGAAGGAAGAGGAAAG 
GATCTGGCCTTGTGGTCACTGGCATCTGAGCCCTGCTGATGGCTGGGGCTCTCGGGCATGCT 
TGGGAGTGCAGGGGGCTCGGGCTGCCTGGCCTGGCTGCACAGAAGGCAAGTGCTGGGGCTCA 
TGGTGCTCTGAGCTGGCCTGGACCCTGTCAGGATGGGCCCCACCTCAGAACCAAACTCACTG 
TCCCCACTGTGGCATGAGGGCAGTGGAGCACCATGTTTGAGGGCGAAGGGCAGAGCGTTTGT 
GTGTTCTGGGGAGGGAAGGAAAAGGTGTTGGAGGCCTTAATTATGGACTGTTGGGAAAAGGG 
TTTTGTCCAGAAGGACAAGCCGGACAAATGAGCGACTTCTGTGCTTCCAGAGGAAGACGAGG 
GAGCAGGAGCTTGGCTGACTGCTCAGAGTCTGTTCTGACGCCCTGGGGGTTCCTGTCCAACC 
CCAGCAGGGGCGCAGCGGGACCAGCCCCACATTCCACTTGTGTCACTGCTTGGAACCTATTT 
ATTTTGTATTTATTTGAACAGAGTTATGTCCTAACTATTTTTATAGATTTGTTTAATTAATA 
GCTTGTCATTTTCAAGTTCATTTTTTATTCATATTTATGTTCATGGTTGATTGTACCTTCCC 
AAGCCCGCCCAGTGGGATGGGAGGAGGAGGAGAAGGGGGGCCTTGGGCCGCTGCAGTCACAT 
CTGTCCAGAGAAATTCCTTTTGGGACTGGAGGCAGAAAAGCGGCCAGAAGGCAGCAGCCCTG 
GCTCCTTTCCTTTGGCAGGTTGGGGAAGGGCTTGCCCCCAGCCTTAGGATTTCAGGGTTTGA 
CTGGGGGCGTGGAGAGAGAGGGAGGAACCTCAATAACCTTGAAGGTGGAATCCAGTTATTTC 
CTGCGCTGCGAGGGTTTCTTTATTTCACTCTTTTCTGAATGTCAAGGCAGTGAGGTGCCTCT 
CACTGTGAATTTGTGGTGGGCGGGGGCTGGAGGAGAGGGTGGGGGGCTGGCTCCGTCCCTCC 
CAGCCTTCTGCTGCCCTTGCTTAACAATGCCGGCCAACTGGCGACCTCACGGTTGCACTTCC 
ATTCCACCAGAATGACCTGATGAGGAAATCTTCAATAGGATGCAAAGATCAATGCAAAAATT 
GTTATATATGAACATATAACTGGAGTCGTCAAAAAGCAAATTAAGAT^GAATTGGACGTTAG 
AAGTTGTCATTTAAAGCAGCCTTCTAATAAAGTTGTTTCAAAGCTGAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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MLCLCLYVPVIGEAQTEFQYFESKGLPAELKSIFKLSVFIPSQEFSTYRQWKQKIVQAGDKD 
LDGQLDFEEFVHYLQDHEKKLRLVFKILDKKNDGRIDAQE IMQSLRDLGVKI SEQQAEKILK 
SMDKNGTMTIDWNEWRDYHLLHPVENIPEIILYWKHSTIFDVGENLTVPDEFTVEERQTGMW 
WRHLVAGGGAGAVSRTCTAPLDRLKVLMQVHASRSNNMGIVGGFTQMIREGGARSLWRGNGI 
NVLKIAPESAIKFMAYEQIKRLVGSDQETLRIHERLVAGSLAGAIAQSSIYPMEVLKTRMAL 
RKTGQYSGMLDCARRILAREGVAAFYKGYVPNMLGIIPYAGIDLAVyETLKNAWLQHYAVNS 
ADPGVFVLLACGTMSSTCGQLASYPLALVRTRMQAQASIEGAPEVTMSSLFKHILRTEGAFG 
LYRGLAPNFMKVI PAVS i S YWYENLKITLGVQSR 
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FIGURE 207 

GG AAGGCAGCGGCAGCTCCACTCAGCCAGTACCCAGATACGCTGGGAACCTTCCCCAGCCAT 

GGCTTCCCTGGGGCAGATCCTCTTCTGGAGCATAATTAGCATCATCATTATTCTGGCTGGAG 

CTlATTGCACTCATCATTGGCnTTGGTATTTCAGGGAGACACTCCATCACAGTCACTACTC 

GCCTCAGCTGGGAACATTGGGGAGGATGGAATCCTGAGCTGCACTTTTGAACCTGACATCAA 

ACTTTCTGATATCGTGATACAATGGCTGAAGGAAGGTGTTTTAGGCTTGGTCCATGAGTTCA 

AAGAAGGCAAAGATGAGCTGTCGGAGCAGGATGAAATGTTCAGAGGCCGGACAGCAGTGTTT 

GCTGATCAAGTGATAGTTGGCAATGCCTCTTTGCGGCTGAAAAACGTGCAACTCACAGATGC 

TGGCACCTACAAATGTTATATCATCACTTCTAAAGGCAAGGGGAATGCTAACCTTGAGTATA 

AAACTGGAGCCTTCAGCATGCCGGAAGTGAATGTGGACTATAATGCCAGCTCAGAGACCTTG 

CGGTGTGAGGCTCCCCGATGGTTCCCCCAGCCCACAGTGGTCTGGGCATCCCAAGTTGACCA 

GGGAGCCAACTTCTCGGAAGTCTCCAATACCAGCTTTGAGCTGAACTCTGAGAATGTGACCA 

TGAAGGTTGTGTCTGTGCTCTACAATGTTACGATCAACAACACATACTCCTGTATGATTGAA 

AATGACATTGCCAAAGCAACAGGGGATATCAAAGTGACAG/^TCGGAGATCAAAAGGCGGAG 

TCACCTACAGCTGCTAAACTCAAAGGCTTCTCTGTGTGTCTCTTCTTTCTTTGCCATCAGCT 

GGGCACTTCTGCCTCTCAGCCCTTACCTGATGCTAAAATAATGTGCCTTGGCCACAAAAAAG 

CATGCAAAGTCATTGTTACAACAGGGATCTACAGAACTATTTCACCACCAGATATGACCTAG 

TTTTATATTTCTGGGAGGAAATGAATTCATATCTAGAAGTCTGGAGTGAGCAAACAAGAGCA 

AGAAACAAAAAGAAGCCAAAAGCAGAAGGCTCCAATATGAACAAGATAAATCTATCTTCAAA 

GACATATTAGAAGTTGGGAAAATAATTCATGTGAACTAGACAAGTGTGTTAAGAGTGATAAG 

TAAAATGCACGTGGAGACAAGTGCATCCCCAGATCTCAGGGACCTCCCCCTGCCTGTCACCT 

GGGGAGTGAGAGGACAGGATAGTGCATGTTCTTTGTCTCTGAATTTTTAGTTATATGTGCTG 

TAATGTTGCTCTGAGGAAGCCCCTGGAAAGTCTATCCCAACATATCCACATCTTATATTCCA 

CT^AATTAAGCTGTAGTATGTACCCTAAGACGCTGCTAATTGACTGCCACTTCGCAACTCAGG 

GGCGGCTGCATTTTAGTAATGGGTCAAATGATTCACTTTTTATGATGCTTCCAAAGGTGCCT 

TGGCTTCTCTTCCCAACTGACAAATGCCAAAGTTGAGAAAAATGATCATAATTTTAGCATAA 

ACAGAGCAGTCGGGGACACCGATTTTATAAATAAACTGAGCACCTTCTTTTTAAACAAAAAA 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 208 



MASLGQILFWSIISIIIILAGAIALIIGFGISGRHSITVTTVASAGNIGEDGILSCTFEPDI 
KLSDIVIQWLKEGVLGLVHEFKEGKDELSEQDEMFRGRTAVFADQVIVGNASLRLKNVQLTD 
AGTYKCYIITSKGKGNANLEYKTGAFSMPEVNVDYNASSETLRCEAPRWFPQPTVVWASQVD 
QGANFSEVSNTS FELNSENVTMKWS VL YNVTINNT YS CM I END I AKATGD I KVTESE I KRR 
SHLQLLNSKASLCVSSFFAISWALLPLSPYLMLK 
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FIGURE 209 

GAATTTGTAGAAGACAGCGGCGTTGCCATGGCGGCGTCTCTGGGGCAGGTGTTGGCTCTGGT 
GCTGGTGGCCGCTCTGTGGGGTGGCACGCAGCCGCTGCTGAAGCGGGCCTCCGCCGGCCTGC 
AGCGGGTTCATGAGCCGACCTGGGCCCAGCAGTTGCTACAGGAGATGAAGACCCTCTTCTTG 
AATACTGAGTACCTGATGCCCTTTCTCCTCAACCAGTGTGGATCCCTTCTCTATTACCTCAC 
CTTGGCATCGACAGATCTGACCCTGGCTGTGCCCATCTGTAACTCTCTGGCTATCATCTTCA 
CACTGATTGTTGGGAAGGCCCTTGGAGAAGATATTGGTGGAAAACGTAAGTTAGACTACTGC 
GAGTGCGGGACGCAGCTCTGTGGATCTCGACATACCTGTGTTAGTTCCTTCCCAGAACCCAT 
CTCCCCAGAGTGGGTGAGGACACGGCCTTTTCCCATCCTGCCCTTTCCTCTGCAGCTGTTTT 
GCTTCCTTGTGGCCATCAGAGTTCCCTTCCCCTGGACAGTCTGGAGAAAGACAGAGGCTGGG 
GTTTGGGAT2SAAGACCAGACCCCATCTGAGCCCTTCCTCCAGCCCTGTACCAGCTCCTACT 
GGCATGGCTGAGCTCAGACCCTCCTGATTTCTGCCTATTATCCCAGGAGCAGTTGCTGGCAT 
GGTGCTCACCGTGATAGGAATTTCACTCTGCATCACAAGCTCAGTGAGTAAGACCCAGGGGC 
AACAGTCTACCCTTTGAGTGGGCCGAACCCACTTCCAGCTCTGCTGCCTCCAGGAAGCCCCT 
GGGCCATGAAGTGCTGGCAGTGAGCGGATGGACCTAGCACTTCCCCTCTCTGGCCTTAGCTT 
CCTCCTCTCTTATGGGGATAACAGCTACCTCATGGATCACAATAAGAGAACAAGAGTGAAAG 
AGTTTTGTAACCTTCAAGTGCTGTTCAGCTGCGGGGATTTAGCACAGGAGACTCTACGCTCA 
CCCTCAGCAACCTTTCTGCCCCAGCAGCTCTCTTCCTGCTAACATCTCAGGCTCCCAGCCCA 
GCCACCATTACTGTGGCCTGATCTGGACTATCATGGTGGCAGGTTCCATGGACTGCAGAACT 
CCAGCTGCATGGAAAGGGCCAGCTGCAGACTTTGAGCCAGAAATGCAAACGGGAGGCCTCTG 
GGACTCAGTCAGAGCGCTTTGGCTGAATGAGGGGTGGAACCGAGGGAAGAAGGTGCGTCGGA 
GTGGCAGATGCAGGAT^TGAGCTGTCTATTAGCCTTGCCTGCCCCACCCATGAGGTAGGCAG 
AAATCCTCACTGCCAGCCCCTCTTAAACAGGTAGAGAGCTGTGAGCCCCAGCCCCACCTGAC 
TCCAGCACACCTGGCGAGTAGTAGCTGTCAATAAATCTATGTAAACAGACAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA7VAAAAAAAAAAA 
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FIGURE 21 n 

MAASLGQVLALVLVAALWGGTQPLLKRASAGLQRVHEPTWAQQLLQEMKTLFLNTEYLMPFL 
LNQCGSLLYYLTLASTDLTLAVPICNSLAIIFTLIVGKALGEDIGGKRKLDYCECGTQLCGS 
RHTCVSSFPEPISPEWVRTRPFPILPFPLQLFCFLVAIRVPFPWTVWRKTEAGVWD 
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FIGURE 21 1 

CTTCTGTAGGACAGTCACCAGGCCAGATCCAGAAGCCTCTCTAGGCTCCAGCTTTCTCTGTG 
GAAGATGACAGCAATTATAGCAGGACCCTGCCAGGCTGTCGAAAAGATTCCGCAATAAAACT 
TTGCCAGTGGGAAGTACCTAGTGAAACGGCCTAAGATGCCACTTCTTCTCATGTCCCAGGCT 
TGAGGCCCTGTGGTCCCCATCCTTGGGAGAAGTCAGCTCCAGCACC ATGA AGGGCATCrTCG 
TTGCTGGTATCACTGCAGTGCTTGTTGCAGCTGTAGAATCTCTGAGCTGCGTGCAGTGTAAT 
•TCATGGGAAAAATCCTGTGTCAACAGCATTGCCTCTGAATGTCCCTCACATGCCAACACCAG 
CTGTATCAGCTCCTCAGCCAGCTCCTCTCTAGAGACACCAGTCAGATTATACCAGAATATGT 
TCTGCTCAGCGGAGAACTGCAGTGAGGAGACACACATTACAGCCTTCACTGTCCACGTGTCT 
GCTGAAGAACACTTTCATTTTGTAAGCCAGTGCTGCCAAGGAAAGGAATGCAGCAACACCAG 
CGATGCCCTGGACCCTCCCCTGT^GAACGTGTCCAGCAACGCAGAGTGCCCTGCTTGTTATG 
AATCTAATGGAACTTCCTGTCGTGGGAAGCCCTGGAAATGCTATGAAGAAGAACAGTGTGTC 
TTTCTAGTTGCAGAACTTAAGAATGACATTGAGTCTAAGAGTCTCGTGCTGAAAGGCTGTTC 
CAACGTCAGTAACGCCACCTGTCAGTTCCTGTCTGGTGAAAACAAGACTCTTGGAGGAGTCA 
TCTTTCGAAAGTTTGAGTGTGCAAATGTAAACAGCTTAACCCCCACGTCTGCACCAACCACT 
TCCCACAACGTGGGCTCCAAAGCTTCCCTCTACCTCTTGGCCCTTGCCAGCCTCCTTCTTCG 
GGGACTGCTGCCCTGAGGTCCTGGGGCTGCACTTTGCCCAGCACCCCATTTCTGCTTCTCTG 
AGGTCCAGAGCACCCCCTGCGGTGCTGACACCCTCTTTCCCTGCTCTGCCCCGTTTAACTGC 
CCAGTAAGTGGGAGTCACAGGTCTCCAGGCAATGCCGACAGCTGCCTTGTTCTTCATTATTA 
AAGCACTGGTTCATTCACTGCCAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 212 

MKGILVAGITAVLVAAVESLSCVQCNSWEKSCVNSIASECPSHANTSCISSSASSSLETPVR 
LYQNMFCSAENCSEETHITAFTVHVSAEEHFHFVSQCCQGKECSNTSDALDPPLKNVSSNAE 
CPACYESNGTSCRGKPWKCYEEEQCVFLVAELKNDIESKSLVLKGCSNVSNATCQFLSGENK 
TLGGVIFRKFECANVNSLTPTSAPTTSHNVGSKASLYLLALASLLLRGLLP 
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FIGURE 213 

GGCCTCGGTTCAAACGACCCGGTGGGTCTACAGCGGAAGGGAGGGAGCGAAGGTAGGAGGCA 

GGGCTTGCCTCACTGGCCACCCTCCCAACCCCAAGAGCCCAGCCCCATGGTCCCCGCC^ 

GCGCGCTGCTGTGGGTCCTGCTGCTGAATCTGGGTCCCCGGGCGGCGGGGGCCCAAGGCCTG 

ACCCAGACTCCGACCGAAATGCAGCGGGTCAGTTTACGCTTTGGGGGCCCCATGACCCGCAG 

CTACCGGAGCACCGCCCGGACTGGTCTTCCCCGGAAGACAAGGATAATCCTAGAGGACGAGA 

ATGATGCCATGGCCGACGCCGACCGCCTGGCTGGACCAGCGGCTGCCGAGCTCTTGGCCGCC 

ACGGTGTCCACCGGCTTTAGCCGGTCGTCCGCCATTAACGAGGAGGATGGGTCTTCAGAAGA 

GGGGGTTGTGATTAATGCCGGAAAGGATAGCACCAGCAGAGAGCTTCCCAGTGCGACTCCCA 

ATACAGCGGGGAGTTCCAGCACGAGGTTTATAGCCAATAGTCAGGAGCCTGAAATCAGGCTG 

ACTTCAAGCCTGCCGCGCTCCCCCGGGAGGTCTACTGAGGACCTGCCAGGCTCGCAGGCCAC 

CCTGAGCCAGTGGTCCACACCTGGGTCTACCCCGAGCCGGTGGCCGTCACCCTCACCCACAG 

CCATGCCATCTCCTGAGGATCTGCGGCTGGTGCTGATGCCCTGGGGCCCGTGGCACTGCCAC 

TGCAAGTCGGGCACCATGAGCCGGAGCCGGTCTGGGAAGCTGCACGGCCTTTCCGGGCGCCT 

TCGAGTTGGGGCGCTGAGCCAGCTCCGCACGGAGCACAAGCCTTGCACCTATCAACAATGTC 

CCTGCAACCGACTTCGGGAAGAGTGCCCCCTGGACACAAGTCTCTGTACTGACACCAACTGT 

GCCTCTCAGAGCACCACCAGTACCAGGACCACCACTACCCCCTTCCCCACCATCCACCTCAG 

AAGCAGTCCCAGCCTGCCACCCGCCAGCCCCTGCCCAGCCCTGGCTTTTTGGAAACGGGTCA 

GGATTGGCCTGGAGGATATTTGGAATAGCCTCTCTTCAGTGTTCACAGAGATGCAACCAATA 

GACAGAAACCAGAGGTAATGGCCACTTCATCCACATGAGGAGATGTCAGTATCTCAACCTCT 

CTTGCCCTTTCAATCCTAGCACCCACTAGATATTTTTAGTACAGAAAAACAAAACTGGAAAA 

CACAA 
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FIGURE 214 
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MVPAAGALLWVLLLNLGPRAAGAQGLTQTPTEMQRVSLRFGGPMTRSYRSTARTGLPRKTRI 

ILEDENDAMADADRIAGPAAAELLAATVSTGFSRSSAINEEDGSSEEGWINAGKDSTSREL 

PSATPNTAGSSSTRFIANSQEPEIRLTSSLPRSPGRSTEDLPGSQATLSQWSTPGSTPSRWP 

SPSPTAMPSPEDIJILVIJ^PWGPWHCHCKSGTMSRSRSGKLHGLSGRI^VGALSQLRTEHK^ 

TYQQCPCNRLREECPLDTSLCTDTNCASQSTTSTRTTTTPFPTIHLRSSPSLPPASPCPALA 

FWKRVRIGLEDIWNSLSSVFTEMQPIDRNQR 
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CCCGGGTCGACCCACGCGTCCGGGGAGAAAGGATGGCCGGCCTGGCGGCGCGGTTGGTCCTG 
CTAGCTGGGGCAGCGGCGCTGGCGAGCGGCTCCCAGGGCGACCGTGAGCCGGTGTACCGCGA 
CTGCGTACTGCAGTGCGAAGAGCAGAACTGCTCTGGGGGCGCTCTGAATCACTTCCGCTCCC 
GCCAGCCAATCTACATGAGTCTAGCAGGCTGGACCTGTCGGGACGACTGTAAGTATGAGTGT 
ATGTGGGTCACCGTTGGGCTCTACCTCCAGGAAGGTCACAAAGTGCCTCAGTTCCATGGCAA 
GTGGCCCTTCTCCCGGTTCCTGTTCTTTCAAGAGCCGGCATCGGCCGTGGCCTCGTTTCTCA 
ATGGCCTGGCCAGCCTGGTGATGCTCTGCCGCTACCGCACCTTCGTGCCAGCCTCCTCCCCC 
ATGTACCACACCTGTGTGGCCTTCGCCTGGGTGTCCCTCAATGCATGGTTCTGGTCCACAGT 
CTTCCACACCAGGGACACTGACCTCACAGAGAAAATGGACTACTTCTGTGCCTCCACTGTCA 
TCCTACACTCAATCTACCTGTGCTGCGTCAGGACCGTGGGGCTGCAGCACCCAGCTGTGGTC 
AGTGCCTTCCGGGCTCTCCTGCTGCTCATGCTGACCGTGCACGTCTCCTACCTGAGCCTCAT 
CCGCTTCGACTATGGCTACAACCTGGTGGCCAACGTGGCTATTGGCCTGGTCAACGTGGTGT 
GGTGGCTGGCCTGGTGCCTGTGGAACCAGCGGCGGCTGCCTCACGTGCGCAAGTGCGTGGTG 
GTGGTCTTGCTGCTGCAGGGGCTGTCCCTGCTCGAGCTGCTTGACTTCCCACCGCTCTTCTG 
GGTCCTGGATGCCCATGCCATCTGGCACATCAGCACCATCCCTGTCCACGTCCTCTTTTTCA 
GCTTTCTGGAAGATGACAGCCTGTACCTGCTGAAGGAATCAGAGGACAAGTTCAAGCTGGAC 
TGAAGACCTTGGAGCGAGTCTGCCCCAGTGGGGATCCTGCCCCCGCCCTGCTGGCCTCCCTT 
CTCCCCTCAACCCTTGAGATGATTTTCTCTTTTCAACTTCTTGAACTTGGACATGAAGGATG 
TGGGCCCAGAATCATGTGGCCAGCCCACCCCCTGTTGGCCCTCACCAGCCTTGGAGTCTGTT 
CTAGGGAAGGCCTCCCAGCATCTGGGACTCGAGAGTGGGCAGCCCCTCTACCTCCTGGAGCT 
GAACTGGGGTGGAACTGAGTGTGTTCTTAGCTCTACCGGGAGGACAGCTGCCTGTTTCCTCC 
CCACCAGCCTCCTCCCCACATCCCCAGCTGCCTGGCTGGGTCCTGAAGCCCTCTGTCTACCT 
GGGAGACCAGGGACCACAGGCCTTAGGGATACAGGGGGTCCCCTTCTGTTACCACCCCCCAC 
CCTCCTCCAGGACACCACTAGGTGGTGCTGGATGCTTGTTCTTTGGCCAGCCAAGGTTCACG 
GCGATTCTCCCCATGGGATCTTGAGGGACCAAGCTGCTGGGATTGGGAAGGAGTTTCACCCT 
GACCGTTGCCCTAGCCAGGTTCCCAGGAGGCCTCACCATACTCCCTTTCAGGGCCAGGGCTC 
CAGCAAGCCCAGGGCAAGGATCCTGTGCTGCTGTCTGGTTGAGAGCCTGCCACCGTGTGTCG 
GGAGTGTGGGCCAGGCTGAGTGCATAGGTGACAGGGCCGTGAGCATGGGCCTGGGTGTGTGT 
GAGCTCAGGCCTAGGTGCGCAGTGTGGAGACGGGTGTTGTCGGGGAAGAGGTGTGGCTTCAA 
AGTGTGTGTGTGCAGGGGGTGGGTGTGTTAGCGTGGGTTAGGGGAACGTGTGTGCGCGTGCT 
GGTGGGCATGTGAGATGAGTGACTGCCGGTGAATGTGTCCACAGTTGAGAGGTTGGAGCAGG 
ATGAGGGAATCCTGTCACCATCAATAATCACTTGTGGAGCGCCAGCTCTGCCCAAGACGCCA 
CCTGGGCGGACAGCCAGGAGCTCTCCATGGCCAGGCTGCCTGTGTGCATGTTCCCTGTCTGG 
TGCCCCTTTGCCCGCCTCCTGCAAACCTCACAGGGTCCCCACACAACAGTGCCCTCCAGAAG 
CAGCCCCTCGGAGGCAGAGGAAGGAAAATGGGGATGGCTGGGGCTCTCTCCATCCTCCTTTT 
CTCCTTGCCTTCGCATGGCTGGCCTTCCCCTCCAAAACCTCCATTCCCCTGCTGCCAGCCCC 
TTTGCCATAGCCTGATTTTGGGGAGGAGGAAGGGGCGATTTGAGGGAGAAGGGGAGAAAGCT 
TATGGCTGGGTCTGGTTTCTTCCCTTCCCAGAGGGTCTTACTGTTCCAGGGTGGCCCCAGGG 
CAGGCAGGGGCCACACTATGCCTGTGCCCTGGTAAAGGTGACCCCTGCCATTTACCAGCAGC 
CCTGGCATGTTCCTGCCCCACAGGAATAGAATGGAGGGAGCTCCAGAAACTTTCCATCCCAA 
AGGCAGTCTCCGTGGTTGAAGCAGACTGGATTTTTGCTCTGCCCCTGACCCCTTGTCCCTCT 
TTGAGGGAGGGGAGCTATGCTAGGACTCCAACCTCAGGGACTCGGGTGGCCTGCGCTAGCTT 
CTTTTGATACTGAAAACTTTTAAGGTGGGAGGGTGGCAAGGGATGTGCTTAATAAATCAATT 
CCAAGCCTCAAAAAAAAAAAAAAAAA 
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FIGURE 216 

MAGLAARLVLI^GAAALASGSQGDREPVYRDCVLQCEEQNCSGGALNHFRSR 
TCRDDCKYEOyiWVTVGLYLQEGHKVPQFHGKWPFSRFLFFQEPASAVASFIiNGI^ 
YRTFVPASSPMYHTCVAFAWVSLNAWFWSTVFHTRDTDLTEKMDYFCASTVI LHS I YLCCVR 
TVGLQHPAWSAFRALLLIJyiLTVHVSYLSLIRFDyGYNLVANVAIGLVNVVWWLAWCLWNQ 
RLPHVRKCWWLLLQGLSLLELLDF PPLFWVLDAHAI WH I STI PVHVLFFSFLEDDSLYLL 
KESEDKFKLD 
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FIGURE 217 

GGCCGCCTGGAATTGTGGGAGTTGTGTCTGCCACTCGGCTGCCGGAGGCCGAAGGTCCGTGA 
CTATGGCTCCCCAGAGCCTGCCTTCATCTAGGATGGCTCCTCTGGGCATGOTGCTTGGGCTG 
CTGATGGCCGCCTGCTTCACCTTCTGCCTCAGTCATCAGAACCTGAAGGAGTTTGCCCTGAC 
CAACCCAGAGAAGAGCAGCACCAAAGAAACGGAGAGAAAAGAAACCAAAGCCGAGGAGGAGC 
TGGATGCCGAAGTCCTGGAGGTGTTCCACCCGACGCATGAGTGGCAGGCCCTTCAGCCAGGG 
CAGGCTGTCCCTGCAGGATCCCACGTACGGCTGAATCTTCAGACTGGGGAAAGAGAGGCAAA 
ACTCCAATATGAGGACAAGTTCCGAAATAATTTGAAAGGCAAAAGGCTGGATATCAACACCA 
ACACCTACACATCTCAGGATCTCAAGAGTGCACTGGCAAAATTCAAGGAGGGGGCAGAGATG 
GAGAGTTCAAAGGAAGACAAGGCAAGGCAGGCTGAGGTAAAGCGGCTCTTCCGCCCCATTGA 
GGAACTGAAGAAAGACTTTGATGAGCTGAATGTTGTCATTGAGACTGACATGCAGATCATGG 
TACGGCTGATCAACAAGTTCAATAGTTCCAGCTCCAGTTTGGAAGAGAAGATTGCTGCGCTC 
TTTGATCTTGAATATTATGTCCATCAGATGGACAATGCGCAGGACCTGCTTTCCTTTGGTGG 
TCTTCAAGTGGTGATCAATGGGCTGAACAGCACAGAGCCCCTCGTGAAGGAGTATGCTGCGT 
TTGTGCTGGGCGCTGCCTTTTCCAGCAACCCCAAGGTCCAGGTGGAGGCCATCGAAGGGGGA 
GCCCTGCAGAAGCTGCTGGTCATCCTGGCCACGGAGCAGCCGCTCACTGCAAAGAAGAAGGT 
CCTGTTTGCACTGTGCTCCCTGCTGCGCCACTTCCCCTATGCCCAGCGGCAGTTCCTGAAGC 
TCGGGGGGCTGCAGGTCCTGAGGACCCTGGTGCAGGAGAAGGGCACGGAGGTGCTCGCCGTG 
CGCGTGGTCACACTGCTCTACGACCTGGTCACGGAGAAGATGTTCGCCGAGGAGGAGGCTGA 
GCTGACCCAGGAGATGTCCCCAGAGAAGCTGCAGCAGTATCGCCAGGTACACCTCCTGCCAG 
GCCTGTGGGAACAGGGCTGGTGCGAGATCACGGCCCACCTCCTGGCGCTGCCCGAGCATGAT 
GCCCGTGAGAAGGTGCTGCAGACT^CTGGGCGTCCTCCTGACCACCTGCCGGGACCGCTACCG 
TCAGGACCCCCAGCTCGGCAGGACACTGGCCAGCCTGCAGGCTGAGTACCAGGTGCTGGCCA 
GCCTGGAGCTGCAGGATGGTGAGGACGAGGGCTACTTCCAGGAGCTGCTGGGCTCTGTCAAC 
AGCTTGCTGAAGGAGCTGAGATGAGGCCCCACACCAGGACTGGACTGGGATGCCGCTAGTGA 
GGCTGAGGGGTGCCAGCGTGGGTGGGCTTCTCAGGCAGGAGGACATCTTGGCAGTGCTGGCT 
TGGCCATTAAATGGAAACCTGAAGGCCAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAT^AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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MAPQSLPSSRMAPLGMLLGLLMAACFTFCLSHQNLKEFALTNPEKSSTKETERKETKAEEEL 
DAEVLEWHPTHEWQALQPGQAVPAGSHWLNLQTGEREAKLQYEDKFRNNLKGKRLDINTN 
TYTSQDLKSALAKFKEGAEMESSKEDKARQAEVKRLFRPIEELKKDFDELNWIETDMQIMV 
RLINKFNSSSSSLEEKIAALFDLEYYVHQMDNAQDLLSFGGLQWINGLNSTEPLVKEYAAF 
VLGA7UPSSNPKVQVEAIEGGALQKLLVILATEQPLTAKKKVLFALCSLLRHFPYAQRQFLKL 
GGLQVLRTLVQEKGTEVIJVVRVVTLLYDLVTEKMFAEEEAELTOEMSPEKLQQYRQVHLLPG 

LWEQGWCEITAHLLALPEHDAREKVLQTLGVLLTTCRDRYRQDPQLGRTLASLQAEYQVLAS 
LELQDGEDEGYFQELLGSVNSLLKELR 
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FIGURE 219 

TTCGGCTTCCGTAGAGGAAGTGGCGCGGACCTTCATTTGGGGTTTCGGTTCCCCCCCTTCCC 
CTTCCCCGGGGTCTGGGGGTGACATTGCACCGCGCCCCTCGTGGGGTCGCGTTGCCACCCCA 
CGCGGACTCCCCAGCTGGCGCGCCCCTCCCATTTGCCTGTCCTGGTCAGGCCCCCACCCCCC 
TTCCCACCTGACCAGCCATGGGGGCTGCGGTGTTTTTCGGCTGCACT^ 

CCGGCCTTCGCGCTTTTCTTGATCACTGTGGCTGGGGACCCGCTTCGCGTTATCATCCTGGT 
CGCAGGGGCATTTTTCTGGCTGGTCTCCCTGCTCCTGGCCTCTGTGGTCTGGTTCATCTTGG 
TCCATGTGACCGACCGGTCAGATGCCCGGCTCCAGTACGGCCTCCTGATTTTTGGTGCTGCT 
GTCTCTGTCCTTCTACAGGAGGTGTTCCGCTTTGCCTACTACAAGCTGCTTAAGAAGGCAGA 
TGAAGGGTTAGCATCGCTGAGTGAGGACGGAAGATCACCCATCTCCATCCGCCAGATGGCCT 
ATGTTTCTGGTCTCTCCTTCGGTATCATCAGTGGTGTCTTCTCTGTTATCAATATTTTGGCT 
GATGCACTTGGGCCAGGTGTGGTTGGGATCCATGGAGACTCACCCTATTACTTCCTGACTTC 
AGCCTTTCTGACAGCAGCCATTATCCTGCTCCATACCTTTTGGGGAGTTGTGTTCTTTGATG 
CCTGTGAGAGGAGACGGTACTGGGCTTTGGGCCTGGTGGTTGGGAGTCACCTACTGACATCG 
GGACTGACATTCCTGAACCCCTGGTATGAGGCCAGCCTGCTGCCCATCTATGCAGTCACTGT 
TTCCATGGGGCTCTGGGCCTTCATCACAGCTGGAGGGTCCCTCCGAAGTATTCAGCGCAGCC 
TCTTGTGTAAGGACTGACTACCTGGACTGATCGCCTGACAGATCCCACCTGCCTGTCCACTG 
CCCATGACTGAGCCCAGCCCCAGCCCGGGTCCATTGCCCACATTCTCTGTCTCCTTCTCGTC 
GGTCTACCCCACTACCTCCAGGGTTTTGCTTTGTCCTTTTGTGACCGTTAGTCTCTAAGCTT 
TACCAGGAGCAGCCTGGGTTCAGCCAGTCAGTGACTGGTGGGTTTGAATCTGCACTTATCCC 
CACCACCTGGGGACCCCCTTGTTGTGTCCAGGACTCCCCCTGTGTCAGTGCTCTGCTCTCAC 
CCTGCCCAAGACTCACCTCCCTTCCCCTCTGCAGGCCGACGGCAGGAGGACAGTCGGGTGAT 
GGTGTATTCTGCCCTGCGCATCCCACCCGAGGACTGAGGGAACCTAGGGGGGACCCCTGGGC 
CTGGGGTGCCCTCCTGATGTCCTCGCCCTGTATTTCTCCATCTCCAGTTCTGGACAGTGCAG 
GTTGCCAAGAAAAGGGACCTAGTTTAGCCATTGCCCTGGAGATGAAATTAATGGAGGCTCAA 
GGATAGATGAGCTCTGAGTTTCTCAGTACTCCCTCAAGACTGGACATCTTGGTCTTTTTCTC 
AGGCCTGAGGGGGAACCATTTTTGGTGTGATAAATACCCTAAACTGCCTTTTTTTCTTTTTT 
GAGGTGGGGGGAGGGAGGAGGTATATTGGAACTCTTCTAACCTCCTTGGGCTATATTTTCTC 
TCCTCGAGTTGCTCCTCATGGCTGGGCTCATTTCGGTCCCTTTCTCCTTGGTCCCAGACCTT 
GGGGGAAAGGAAGGAAGTGCATGTTTGGGAACTGGCATTACTGGAACTAATGGTTTTAACCT 
CCTTAACCACCAGCATCCCTCCTCTCCCCAAGGTGAAGTGGAGGGTGCTGTGGTGAGCTGGC 
CACTCCAGAGCTGCAGTGCCACTGGAGGAGTCAGACTACCATGACATCGTAGGGAAGGAGGG 
GAGATTTTTTTGTAGTTTTTAATTGGGGTGTGGGAGGGGCGGGGAGGTTTTCTATAAACTGT 
ATCATTTTCTGCTGAGGGTGGAGTGTCCCATCCTTTTAATCAAGGTGATTGTGATTTTGACT 
AATAAAAAAGAATTTGTAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 220 

MGAAVFFGCTFVAFGPAFALFLITVAGDPLRVIILVAGAFFWLVSLLLASVVWFILVHVTDR 
SDARLQYGLLIFGAAVSVLLQEVFRFAYYKLLKKADEGLASLSEDGRSPISIRQMAYVSGLS 
FGI ISGVFSVINILi\DALGPGWGIHGDSPYYFLTSAFLTAAI ILLHTFWGWFFDACERRR 
YWAIX3LWGSHLLTSGLTFLNPWYEASLLPIYAVTVSMGLWAFITAGGSLRSIQRSLLCKD 
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FIGURE 221 

AAGCTGGTTTAAGGAAGCAGAGGAGGGTTAGATTCGTTGAGTGAGGACGGAAGATCAACCCA 
TTTCCATTCCGCCAGATGGCCTATGTTTCTGGTCTCTCCCTTCGGNATCATCAGTGGTGTNT 
TNTCTGTTATCAATATTTTGGCTGATGCANTTGGGCCAGGTGTGGTTGGGATCCATGGAGAC 
TCACCCTATTANTTCCTGANTTCAGCCTTTNTGACAGCAGCCATTATCCTGCTC 
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FIGURE 222 

GACCGACCGTTCAGATGCCCGGTTCCAGTACGGCTTCCTGATTTTTGGTGCTGCTGTNTCTG 
TCCTTCTACAGGAGGTGTTCCGCTTTGCCTANTACAAGCTGCTTAAGAAGGCAGATGAGGGG 
TTAGCATNGCTGAGTGAGGACGGAAGATCACCCATTTCCATCCGCCAGATGGCCTATGTTTN 
TGGTOTTTCCTTCGGTATO^TCAGTGGTGTTTTNTCTGTTATCAATATTTTGGNTGATGCAN 
TTGGGCCAGGTGTGGTTGGGATCCATGGAGANTCACCCTATTAATTCCTGAATTCAGCCTTT 
NTGACAGCAGCCATTATCCTGNTCCATACCTTTTGGGGAGTTGTGTTTTTTGATGCCTGTGA 
GAGGAG 
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FIGURE 223 

NGTTGGAGT^GTGGCGCGGACNTTCATTTGGGGTTTCGGTTTCCCCCCTTTCCCTTTCCCCG 
GGGTCTGGGGTGACATTGCACGGGCCCCTCGTGGGGTCGCGTTGCCACCCCACGCGGACTCC 
CCAGNTGGNGCGCCCTTCCCATTTGCCTGTCCTGGTCAGGCCCCCACCCCCCTTCCCACNTG 
ACCAGCCATGGGGGCTGCGGTGTTTTTCGGCTGCACTTTCGTCGCGTTCGGCCCGGCCTTCG 
CGCTTTTCTTGATCACTGTGGCTGGGGACCCGCTTCGCGTTATCATCCTGGTCGCAGGGGCA 
TTTTTCTGGCTGGTCTCCCTGCTCCTGGCCTCTGTGGTCTGGTTCATCTTGGTCCATGTGAC 
CGACCGGTCAGATGCCCGGCTCCAGTACGGCCTCCTGATTTTTGGTGCTGCTGTCTCTGTCC 
TTCTACAGGAGGTGTTCCGCTTTGCCTACTACAAGCTGCTTAAGAAGGCAGATGAGGGGTTA 
GCATCGCTGAGTGAGGACGGAAGATCACCCATCTCCATCCGCCAGATGGCCTATGTTTCTGG 
TCTCTCCTTCGGTATCATCAGTGGTGTCTTCTCTGTTATCAATATTTTGGCTGATGCACTTG 
GGCCAGGTGTGGTTGGGATCCATGGAGACTCACCC 
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FIGURE 224 

GTAAAAGAAAGTGGCCGGACCTTCATTGGGGTTTCGGTTCCCCCCTTTCCCNTTCCCCGGGG 
TCTGGGGGTGACATTGCACCGCGCCCNTCGTGGGGTCGCGTTGCCACCCCACGCGGACTCCC 
CAGNTGGCGCGCCCCTCCCATTTGCCTGTCCTGGTCAGGCCCCCACCCCCCTTCCCACCTGA 
CCAGCCATGGGGGCTGCGGTGTTTTTCGGGCTGCACTTTCGTCGCGTTCGGGCCCGGCCTTC 
GCGCTTTTCTTGATCACTGTGGCTGGGGACCCGCTTCGCGTTATCATCCTGGTCGCAGGGGC 
ATTTTTCTGGCTGGTCTCCCTGCTCCTGGCCTCTGTGGTCTGGTTCATCTTGGTCCATGTGA 
CCGACCGGTCAGATGCCCGGCTCCAGTACGGCCTCCTGATTTTTGGTGCTGCTGTCTCTGTC 
CTTCTACAGGAGGTGTTCCGCTTTGCCTACTACAAGCTGCTTAAGAAGGCAGATGAGGGGTT 
AGCATCGCTGAGTGAGGACGGAAGATCACCCATCTCCATCCGCCAGATGGCCTATGTTTCTG 
GTCTCTCCTTCGGTATCATCAGTGGTGTCTTCTCTGTTATCAATATTTTGGCTGATGCACTT 
GGGCCAGGTGTGGTTGGGATCCATGGAGAC 
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FIGURE 225 

GCCCCAGGGAGCAGTGGGTGGTTATAACTCAGGCCCGGTGCCCAGAGCCCAGGAGGAGGCAG 
TGGCCAGGAAGGCACAGGCCTGAGAAGTCTGCGGCTGAGCTGGGAGCAAATCCCCCACCCCC 
TACCTGGGGGACAGGGCAAGTGAGACCTGGTGAGGGTGGCTCAGCAGGCAGGGAAGGAGAGG 
TGTCTGTGCGTCCTGCACCCACATCTTTCTCTGTCCCCTCCTTGCCCTGTCTGGAGGCTGCT 
AGACTCCTATCTTCTGAATTCTATAGTGCCTGGGTCTCAGCGCAGTGCCGATGGTGGCCCGT 
CCTTGTGGTTCCTCTCTACCTGGGGAAATAAGGTGCAGCGGCCATGGCTACAGCAAGACCCC 
CCTGGATGTGGGTGCTCTGTGCTCTGATCACAGCCTTGCTTCTGGGGGTCACAGAGCATGTT 
CTCGCCAACAATGATGTTTCCTGTGACCACCCCTCTAACACCGTGCCCTCTGGGAGCAACCA 
GGACCTGGGAGCTGGGGCCGGGGAAGACGCCCGGTCGGATGACAGCAGCAGCCGCATCATCA 
ATGGATCCGACTGCGATATGCACACCCAGCCGTGGCAGGCCGCGCTGTTGCTAAGGCCCAAC 
CAGCTCTACTGCGGGGCGGTGTTGGTGCATCCACAGTGGCTGCTCACGGCCGCCCACTGCAG 
GAAGAAAGTTTTCAGAGTCCGTCTCGGCCACTACTCCCTGTCACCAGTTTATGAATCTGGGC 
AGCAGATGTTCCAGGGGGTCAAATCCATCCCCCACCCTGGCTACTCCCACCCTGGCCACTCT 
AACGACCTCATGCTCATCAAACTGAACAGAAGAATTCGTCCCACTAAAGATGTCAGACCCAT 
CAACGTCTCCTCTCATTGTCCCTCTGCTGGGACAAAGTGCTTGGTGTCTGGCTGGGGGACAA 
CCAAGAGCCCCCAAGTGCACTTCCCTAAGGTCCTCCAGTGCTTGAATATCAGCGTGCTAAGT 
CAGAAAAGGTGCGAGGATGCTTACCCGAGACAGATAGATGACACCATGTTCTGCGCCGGTGA 
CAAAGCAGGTAGAGACTCCTGCCAGGGTGATTCTGGGGGGCCTGTGGTCTGCAATGGCTCCC 
TGCAGGGACTCGTGTCCTGGGGAGATTACCCTTGTGCCCGGCCCAACAGACCGGGTGTCTAC 
ACGAACCTCTGCAAGTTCACCAAGTGGATCCAGGAAACCATCCAGGCCAACTCCTGAGTCAT 
CCCAGGACTCAGCACACCGGCATCCCCACCTGCTGCAGGGACAGCCCTGACACTCCTTTCAG 
ACCCTCATTCCTTCCCAGAGATGTTGAGAATGTTCATCTCTCCAGCCCCTGACCCCATGTCT 
CCTGGACTCAGGGTCTGCTTCCCCCACATTGGGCTGACCGTGTCTCTCTAGTTGAACCCTGG 
GAACAATTTCCAAAACTGTCCAGGGCGGGGGTTGCGTCTCAATCTCCCTGGGGCACTTTCAT 
CCTCAAGCTCAGGGCCCATCCCTTCTCTGCAGCTCTGACCCAAATTTAGTCCCAGAAATAAA 
CTGAGAAGTGGAAAAAAA/y^ 
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MATARPPWMWVLCALITALLIX3VTEHVLANNDVSCDHPSNTVPSGSNQDLGAGAGEDARSDD 
SSSRIINGSDCDMHTQPWQAALLLRPNQLYCGAVLVHPQWLLTAAHCRKKVFRVRLGHYSLS 
PVYESGQQMFQGVKSIPHPGYSHPGHSNDLMLIKLNRRIRPTKDVRPINVSSHCPSAGTKCL 
VSGWGTTKSPQVHFPKVLQCLNISVLSQKRCEDAYPRQIDDTMFCAGDKAGRDSCQGDSGGP 
WCNGSLQGLVSWGDYPCARPNRPGVYTNLCKFTKWIQETIQANS 
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FIGURE 227 



GCTTCGCACGCCTGGAGAGCGCCCAGGCCTCGGTGCTGCAGGCGCTGA^GASArSr^ 

cagccacggctggtgggcgaccaggagcaggagctgctggacSc?gS^^ 

CCGGCTGCTGGCCCGAGCCTCAGAGCTGCAGACGGAGTGCATGG^GCTGCGG^S^ 

gcacgctgggccagggcctcagcgccctgcagagtgagcaggg^gcctS?^ 

tctgagagccagggccacatggctcacctggtgaactccgtcagSaStcc^^ 

gcagagggaccgggggctgggccggccccgcaacaaggccgaStcagagS^ 

ggggaacccggccccggggctgtgccactggctcccggcccoSga^^^ 

ctaagcggacagcaggacgatggcgtctactctgtctttcccac^^^ 

ccaggtgtactgtgacatgcgcacggacggcggcggctggacg^^^ 

acggctccgtgaacttcttccggggctgggacgcgtaccgSggctSggSSct^ 

ggggagcactggctagggctcaagaggatccacgccctgacca(S^g™goc^^ 

gcacgtggacctggaggactttgagaatggcacggcctatgcccgS-acS^^^ 

tgggcttgttctccgtggaccctgaggaagacgggtacccgctcaccgtgg^^^ 

ggcactgcaggcgactccctcctgaagcacagcggcatgaggttcac?I??S^^ 

cagcgaccattcagagaacaactgtgccgccttctaccgcggtgcctggtgS?^^ 

gccacacgtccaacctcaatgggcagtacctgcgcggtgcgcaScctcctatg?SS??S 

gtggagtggtcctcctggaccggctggcagtactcactcaagtt?tSgaS?g^^ 

gccggtccgggaggaccgciagactggtgcaccttgtccttggccctgctggt^^ 

cccatccccgaccccacctcactctttcgtgaatgttctccacccSc^^^ 

ccactctccagtagggaggggccgggccatccctgacacga^sct^^ 

cacacatcgccttctcgccgtccccaccccctccatttgg^gSSSS?^^^ 

tgctgatgggggctggcaaacttgacgaccccaactcctgcctgcco^^^ 

tgctgtttgccgtcccctggccaggatggtggagtctgccccaggcaccctctg?^^ 

ggccaaatacccggcattatggggacagagagcagggggcagSSacccctSSg??^ 

ctagcagatcgtggggaatgtcaggtctctctgaggtcaggtctgagg?cSa?^^ 

ccctcccaatgccaacccccaccccgtttccctggtgcccagagaacccacctct^ 

gggcctcagcctggctgtgggctgggtggccccatcctaccaggccctS^^^ 

SGCtgctgcctttggggacccacgctccaag 

cctgtgccccggcaggcctggggtctgcagtcctcttacctgctgtgcccacctgSct^ 

tctcaaatgaggcccaacccatcccccacccagctcccggccgtcctcctacotg^^^ 

cggggctgccatccctvtttctcctgcctctggaaggtgggtg^^^^ 

GGACTGCGCTAATGGGAAGCTCTTGGTTTTCTGGGCTGGGGCCmSSGGG^^^ 

gcttgtacaacccccaccaccaatttcccagggactc<SSgg^^^^ 

gccttgggggtgatgaccccttccctgaggtggctgtctccatSggaggSSc^^ 

attgaccgtggccacctggacccaggccaggcccggcccScSgtgg??^^S 

^Ji^E^TCACCGGGCAAATGGGGTCGGGGGGACTGGGGCACCAGAC^^ 



GGG 



GCCTAGCAGCCTCTCCTCGGGCAGGAGGGGAGGTGGCTTCCTC^^^ 

GGTGCCTAGGGGGTGTGGGGTTCCGTTCTCCCTTCCCCTCCCAC^GT^GC^^ 

AACAATAAATTTGACTTGGCACCACTGGGGGTTGGTGGGAGAGGCCCTG^^ 
TGTCCCAGTGCCACCAGGTCATCCACATGCGCAG ^'^^'^^^'-^-GTGTGACCTGGCTCTC 
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MVNDRWKTMCXSAAQLEDRPRDKPQRPSCGYVLCTVLIAIAVLLAVAVTGAVLFLNHAHAPGT 

APPPWSTGAASANSALVTVERADSSHLSILIDPRCPDLTDSFARLESAQASVLQALTEHQA 

QPRLVGDQEQELLDTLADQLPRLLARASELQTECMGLRKGHGTLGQGLSALQSEQGRLIQLL 

SESQGHMAHLVNSVSDILDALQRDRGLGRPRNKADLQRAPARGTRPRGCATGSRPRDCLDVL 

LSGQQDDGVYSVFPTHYPAGFQVYCDMRTDGGGWTVFQRREDGSVNFFRGWDAYRDGFGRLT 

GEHWLGLKRIHALTTQAAYELHVDLEDFENGTAYARYGSFGVGLFSVDPEEDGYPLTVADYS 

GTAGDSLLKHSGMRFTTKDRDSDHSENNCAAFYRGAWWYRNCHTSNLNGQYLRGAHASYADG 
VEWSSWTGWQYSLKFSEMKIRPVREDR 
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FIGURE 229 

GCAGTCAGAGAOTTCCCCTGCCCCTCGCTGGGAAAGAACATTAGGAATGCCTTTTAGTGCCT 

TGCTTCCTGAACTAGCTCACAGTAGCCCGGCGGCCCAGGGCAATCCGACCACATTTCACTCT 

CACCGCTGTAGGAATCCAGMTGCT^GGCCAAGTACAGCAGCACGAGGGACATGCTGGAT 

ATGGGGACACCACCATGAGCCTGCATTCTCAAGCCTCTGCCACAACTCGGCATCCAGAGCCC 

CGGCGCACAGAGCACAGGGCTCCCTCTTCAACGTGGCGACCAGTGGCCCTGACCCTGCTGAC 

TTTGTGCTTGGTGCTGCTGATAGGGCTGGCAGCCCTGGGGCTTTTGTTTTTTCAGTACTACC 

AGCTCTCCAATACTGGTCAAGACACCATTTCTCAAATGGAAGAAAGATTAGGAAATACGTCC 

CAAGAGTTGCAATCTCTTCAAGTCCAGAATATAAAGCTTGCAGGAAGTCTGCAGCATGTGGC 

TGAAAAACTCTGTCGTGAGCTGTATAACAAAGCTGGAGCACACAGGTGCAGCCCTTGTACAG 

AACAATGGAAATGGCATGGAGACAATTGCTACCAGTTCTATAAAGACAGCAAAAGTTGGGAG 

GACTGTAAATATTTCTGCCTTAGTGAAAACTCTACCATGCTGAAGATAAACAAACAAGAAGA 

CCTGGAATTTGCCGCGTCTCAGAGCTACTCTGAGTTTTTCTACTCTTATTGGACAGGGCTTT 

TGCGCCCTGACAGTGGCAAGGCCTGGCTGTGGATGGATGGAACCCCTTTCACTTCTGAACTG 

TTCCATATTATAATAGATGTCACCAGCCCAAGAAGCAGAGACTGTGTGGCCATCCTCAATGG 

GATGATCTTCTCAAAGGACTGCAAAGAATTGAAGCGTTGTGTCTGTGAGAGAAGGGCAGGAA 

TGGTGAAGCCAGAGAGCCTCCATGTCCCCCCTGAAACATTAGGCGAAGGTGACTGATTCGCC 

CTCTGCAACTACAAATAGCAGAGTGAGCCAGGCGGTGCCAAAGCAAGGGCTAGTTGAGACAT 

TGGGAAATGGAACATAATCAGGAAAGACTATCTCTCTGACTAGTACAAAATGGGTTCTCGTG 

TTTCCTGTTCAGGATCACCAGCATTTCTGAGCTTGGGTTTATGCACGTATTTAACAGTCACA 

AGAAGTCTTATTTACATGCCACCAACCAACCTCAGAAACCCATAATGTCATCTGCCTTCTTG 

GCTTAGAGATAACTTTTAGCTCTCTTTCTTCTCAATGTCTAATATCACCTCCCTGTTTTCAT 

GTCTTCCTTACACTTGGTGGAATAAGAAACTTTTTGAAGTAGAGGAAATACATTGAGGTAAC 

ATCCTTTTCTCTGACAGTCAAGTAGTCCATCAGAAATTGGCAGTCACTTCCCAGATTGTACC 

AGCAAATACACAAGGAATTCTTTTTGTTTGTTTCAGTTCATACTAGTCCCTTCCCAATCCAT 

CAGTAAAGACCCCATCTGCCTTGTCCATGCCGTTTCCCAACAGGGATGTCACTTGATATGAG 

AATCTCAAATCTCAATGCCTTATAAGCATTCCTTCCTGTGTCCATTAAGACTCTGATAATTG 

TCTCCCCTCCATAGGAATTTCTCCCAGGAAAGAAATATATCCCCATCTCCGTTTCATATCAG 

AACTACCGTCCCCGATATTCCCTTCAGAGAGATTAAAGACCAGATU^AAAGTGAGCCTCTTCA 

TCTGCACCTGTAATAGTTTCAGTTCCTATTTTCTTCCATTGACCCATATTTATACCTTTCAG 

GTACTGAAGATTTAATAATAATAAATGTAAATACTGTGAAAAA 
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FIGURE 230 



MQAKYSSTRDMIJ)DDGDTTMSLHSQASATTRHPEPRRTEHRAPSSTWRPVALTLLTLCLVLL 
IGLAAJ^LLFFQYYQLSNTGQDTISQMEERLGOTSQELQSLQVQNIKLAGSLQHVAEKLCRE 
LYNKAGAHRCSPCTEQWKWHGDNCYQFYKDSKSWEDCKYFCLSENSTMLKINKQEDLEFAAS 
QSYSEFFYSYWTGLLRPDSGKAWLWMDGTPFTSELFHIIIDVTSPRSRDCVAILNGMIFSKD 
CKELKRCVCERRAGMVKPESIiHVPPETLGEGD 
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FIGURE 231 

AATTTTCACCGCTGTAGGAATCCAGATGCAGGCCAAGTACAGCAGCACGAGGGACT^TGOT 

ATGATGATGGGACACCACCATGAGCCTGCATTNTCAAGCTTTTGCCACAATTCGGCATCCAG 

AGCCCCGGCGCACAGAGCACAGGGNTCCTTTTTCAACGTGGCGACCAGTGGCCCTGACCCTG 

CTGACTTTGTGCTTGGTGCTGCTGATAGGGCTGGCAGCCCTGGGGCTTTTGTTTTTTCAGTA 

CTACCAGCTCTCCT^TACTGGTCT^GACACCATTTCTCAAATGGAAGA/^GATTAGGAAATA 

CGTCCCAAGAGTTGCAATTTNTTCAAGTCCAGAATATAAAGCTTGCAGGAAGTNTGCAGCAT 

GTGGCTGAAAAACTCTGTCGTGAGCTGTATAACAAAGCTGGAGGAACTTTGAAGGAGGGCAA 

AGTNTCCTCATNTACTATACACACACCACTTCCC 
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FIGURE 

GCCGAGCGCAAGAACCCTGCGCAGCCCAGAGCAGCTGCTGGAGGGGAATCGAGGCGCGGCTC 

CGGGGATTCGGCTCGGGCCGCTGGCTCTGCTCTGCGGGGAGGGAGCGGGCCCGCCCGCGGGG 

CCCGAGCCCTCCGGATCCGCCCCCTCCCCGGTCCCGCCCCCTCGGAGACTCCTCTGGCTGCT 

CTGGGGGTTCGCCGGGGCCGGGGACCCGCGGTCCGGGCGCCATGCGGGCATCGCTGCTGCTG 

TCGGTGCTGCGGCCCGCAGGGCCCGTGGCCGTGGGCATCTCCCTGGGCTTCACCCTGAGCCT 

GCTCAGCGTCACCTGGGTGGAGGAGCCGTGCGGCCCAGGCCCGCCCCAACCTGGAGACTCTG 

AGCTGCCGCCGCGCGGCAACACCAACGCGGCGCGCCGGCCCAACTCGGTGCAGCCCGGAGCG 

GAGCGCGAGAAGCCCGGGGCCGGCGAAGGCGCCGGGGAGAATTGGGAGCCGCGCGTCTTGCC 

CTACCACCCTGCACAGCCCGGCCAGGCCGCCAAAAAGGCCGTCAGGACCCGCTACATCAGCA 

CGGAGCTGGGCATCAGGCAGAGGCTGCTGGTGGCGGTGCTGACCTCTCAGACCACGCTGCCC 

ACGCTGGGCGTGGCCGTGAACCGCACGCTGGGGCACCGGCTGGAGCGTGTGGTGTTCCTGAC 

GGGCGCACGGGGCCGCCGGGCCCCACCTGGCATGGCAGTGGTGACGCTGGGCGAGGAGCGAC 

CCATTGGACACCTGCACCTGGCGCTGCGCCACCTGCTGGAGCAGCACGGCGACGACTTTGAC 

TGGTTCTTCCTGGTGCCTGACACCACCTACACCGAGGCGCACGGCCTGGCACGCCTAACTGG 

CCACCTCAGCCTGGCCTCCGCCGCCCACCTGTACCTGGGCCGGCCCCAGGACTTCATCGGCG 

GAGAGCCCACCCCCGGCCGCTACTGCCACGGAGGCTTTGGGGTGCTGCTGTCGCGCATGCTG 

CTGCAACAACTGCGCCCCCACCTGGAAGGCTGCCGCAACGACATCGTCAGTGCGCGCCCTGA 

CGAGTGGCTGGGTCGCTGCATTCTCGATGCCACCGGGGTGGGCTGCACTGGTGACCACGAGG 

GGGTGCACTATAGCCATCTGGAGCTGAGCCCTGGGGAGCCAGTGCAGGAGGGGGACCCTCAT 

TTCCGAAGTGCCCTGACAGCCCACCCTGTGCGTGACCCTGTGCACATGTACCAGCTGCACAA 

AGCTTTCGCCCGAGCTGAACTGGAACGCACGTACCAGGAGATCCAGGAGTTACAGTGGGAGA 

TCCAGAATACCAGCCATCTGGCCGTTGATGGGGACCGGGCAGCTGCTTGGCCCGTGGGTATT 

CCAGCACCATCCCGCCCGGCCTCCCGCTTTGAGGTGCTGCGCTGGGACTACTTCACGGAGCA 

GCACGCTTTCTCCTGCGCCGATGGCTCACCCCGCTGCCCACTGCGTGGGGCTGACCGGGCTG 

ATGTGGCCGATGTTCTGGGGACAGCTCTAGAGGAGCTGAACCGCCGCTACCACCCGGCCTTG 

CGGCTCCAGAAGCAGCAGCTGGTGAATGGCTACCGACGCTTTGATCCGGCCCGGGGTATGGA 

ATACACGCTGGACTTGCAGCTGGAGGCACTGACCCCCCAGGGAGGCCGCCGGCCCCTCACTC 

GCCGAGTGCAGCTGCTCCGGCCGCTGAGCCGCGTGGAGATCTTGCCTGTGCCCTATGTCACT 

GAGGCCTCACGTCTCACTGTGCTGCTGCCTCTAGCTGCGGCTGAGCGTGACCTGGCCCCTGG 

CTTCTTGGAGGCCTTTGCCACTGCAGCACTGGAGCCTGGTGATGCTGCGGCAGCCCTGACCC 

TGCTGCTACTGTATGAGCCGCGCCAGGCCCAGCGCGTGGCCCATGCAGATGTCTTCGCACCT 

GTCAAGGCCCACGTGGCAGAGCTGGAGCGGCGTTTCCCCGGTGCCCGGGTGCCATGGCTCAG 

TGTGCAGACAGCCGCACCCTCACCACTGCGCCTCATGGATCTACTCTCCAAGAAGCACCCGC 

TGGACACACTGTTCCTGCTGGCCGGGCCAGACACGGTGCTCACGCCTGACTTCCTGAACCGC 

TGCCGCATGCATGCCATCTCCGGCTGGCAGGCCTTCTTTCCCATGCATTTCCAAGCCTTCCA 

CCCAGGTGTGGCCCCACCACAAGGGCCTGGGCCCCCAGAGCTGGGCCGTGACACTGGCCGCT 

TTGATCGCCAGGCAGCCAGCGAGGCCTGCTTCTACAACTCCGACTACGTGGCAGCCCGTGGG 

CGCCTGGCGGCAGCCTCAGAACAAGAAGAGGAGCTGCTGGAGAGCCTGGATGTGTACGAGCT 

GTTCCTCCACTTCTCCAGTCTGCATGTGCTGCGGGCGGTGGAGCCGGCGCTGCTGCAGCGCT 

ACCGGGCCCAGACGTGCAGCGCGAGGCTCAGTGAGGACCTGTACCACCGCTGCCTCCAGAGC 

GTGCTTGAGGGCCTCGGCTCCCGAACCCAGCTGGCCATGCTACTCTTTGAACAGGAGCAGGG 

CAACAGCACCTGACCCCACCCTGTCCCCGTGGGCCGTGGCATGGCCACACCCCACCCCACTT 

CTCCCCCAAAACCAGAGCCACCTGCCAGCCTCGCTGGGCAGGGCTGGCCGTAGCCAGACCCC 

AAGCTGGCCCACTGGTCCCCTCTCTGGCTCTGTGGGTCCCTGGGCTCTGGACAAGCACTGGG 

GGACGTGCCCCCAGAGCCACCCACTTCTCATCCCAAACCCAGTTTCCCTGCCCCCTGACGCT 

GCTGATTCGGGCTGTGGCCTCCACGTATTTATGCAGTACAGTCTGCCTGACGCCAGCCCTGC 

CTCTGGGCCCTGGGGGCTGGGCTGTAGAAGAGTTGTTGGGGAAGGAGGGAGCTGAGGAGGGG 

GCATCTCCCAACTTCTCCCTTTTGGACCCTGCCGAAGCTCCCTGCCTTTAATAAACTGGCCA 
AGTGTGGAAAAA 
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FIGURE 233 

MRASLLLSVLRPAGPVAVGISLGFTLSLLSVTWVEEPCGPGPPQPGDSELPPRGNTNAARRP 
NSVQPGAEREKPGAGEGAGENWEPRVLPYHPAQPGQAAKKAVRTRYISTELGIRQRLLVAVL 
TSQTTLPTLGVAVNRTLGHRLERWFLTGARGRRAPPGMAWTLGEERPIGHLHLALRHLLE 
QHGDDFDWFFLVPDTTYTEAHGLARLTGHLSLASAAHLYLGRPQDFIGGEPTPGRYCHGGFG 
VLLSRMLLQQLRPHLEGCRNDIVSARPDEWLGRCILDATGVGCTGDHEGVHYSHLELSPGEP 
VQEGDPHFRS ALTAHPVRDPVHMYQLHKAFARAELERTYQE I QELQWE I QNTSHLAVDGDRA 
AAWPVGIPAPSRPASRFEVLRWDYFTEQHAFSCADGSPRCPLRGADRADVADVLGTALEELN 
RRYHPALRLQKQQLVNGYRRFDPARGMEYTLDLQLEALTPQGGRRPLTRRVQLLRPLSRVEI 
LPVPYVTEASRLTVLLPLAAAERDLAPGFLEAFATAALEPGDAAAALTLLLLYEPRQAQRVA 
HADVFAPVKAHVAELERRFPGARVPWLSVQTAAPSPLRLMDLLSKKHPLDTLFLLAGPDTVL 
TPDFLNRCRMHAISGWQAFFPMHFQAFHPGVAPPQGPGPPELGRDTGRFDRQAASEACFYNS 
DYVAARGRLAAASEQEEELLESLDVYELFLHFSSLHVLRAVEPALLQRYRAQTCSARLSEDL 
YHRCLQSVLEGLGSRTQLAMLLFEQEQGNST 
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FIGURE 234 

GCTCTGGCCGGCCCCGGCGATTGGTCACCGCCCGCTAGGGGACAGCCCTGGCCTCCTCTGAT 
TGGCAAGCGCTGGCCACCTCCCCACACCCCTTGCGAACGCTCCCCTAGTGGAGAAAAGGAGT 
AGCTATTAGCCAATTCGGCAGGGCCCGCTTTTTAGAAGCTTGATTTCCTTTGAAGATGAAAG 
ACTAGCGGAAGCTCTGCCTCTTTCCCCAGTGGGCGAGGGAACTCGGGGCGATTGGCTGGGAA 
CTGTATCCACCCAAATGTCACCGATTTCTTCCTATGCAGGAAATGAGCAGACCCATCAATAA 
GAAATTTCTCAGCCTGGCCGAAAATGGTTGGCCCCACGAAGCCACGACAACTGGAGGCAAAG 
AGGGTTGCTCAACGCCCCGCCTCATTGGAAAACCAAATCAGATCTGGGACCTATATAGCGTG 
GCGGAGGCGGGGCGATGATTGTCGCGCTCGCACCCACTGCAGCTGCGCACAGTCGCATTTCT 
TTCCCCGCCCCTGAGACCCTGCAGCACCATCTGTCATGGCGGCTGGGCTGTTTGGTTTGAGC 
GCTCGCCGTCTTTTGGCGGCAGCGGCGACGCGAGGGCTCCCGGCCGCCCGCGTCCGCTGGGA 
ATCTAGCTTCTCCAGGACTGTGGTCGCCCCGTCCGCTGTGGCGGGAAAGCGGCCCCCAGAAC 
CGACCACACCGTGGCAAGAGGACCCAGAACCCGAGGACGAAAACTTGTATGAGAAGAACCCA 
GACTCCCATGGTTATGACAAGGACCCCGTTTTGGACGTCTGGAACATGCGACTTGTCTTCTT 
CTTTGGCGTCTCCATCATCCTGGTCCTTGGCAGCACCTTTGTGGCCTATCTGCCTGACTACA 
GGATGAAAGAGTGGTCCCGCCGCGAAGCTGAGAGGCTTGTGAAATACCGAGAGGCCAATGGC 
CTTCCCATCATGG7ATCCAACTGCTTCGACCCCAGCAAGATCCAGCTGCCAGAGGATGAGTG 
ACCAGTTGCTAAGTGGGGCTCAAGAAGCACCGCCTTCCCCACCCCCTGCCTGCCATTCTGAC 
CTCTTCTCAGAGCACCTAATTAAAGGGGCTGAAAGTCTGAA 
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FIGURE 235 

MAAGLFGLSARRLLAAAATRGLPAARVRWESSFSRTWAPSAVAGKRPPEPTTPWQEDPEPE 
DENLYEKNPDSHGYDKDP VLDVWNMRLVFFFGVS 1 1 LVLGSTFVAYLPDYRMKEWSRREAER 
LVKYREANGLPIMESNCFDPSKIQLPEDE 
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GG CGGCTGGGCTGTTTGGTTTGAGCGCTCGCCGTCTTTTGGCGGCAGCGGCGACGCGAGGGC 
TCCCGGCCGCCCGCGTCCGCTGGGAATCTAGCTTCTCCAGGACTGTGGTCGCCCCGTCCGCT 
GTGGCGGGA7VAGCGGCCCCCAGAACCGACCACACCGTGGCAAGAGGACCCAGAACCCGAGGA 
CGAAAACTTGTATGAGAAGAACCCAGACTCCCATGGTTATGACAAGGACCCCGTTTTGGACG 
TCTGGAACATGCGACTTGTCTTCTTCTTTGGCGTCTCCATCATCCTGGTCCTTGGCAGCACC 
TTTGTGGCCTATCTGCCTGACTACAGGATGAAAGAGTGGTCCCGCCGCGAAGCTGAGAGGCT 
TGTGAAATACCGAGAGGCCAATGGCCTTCCCATCATGGAATCCAACTGCTTCGACCCCAGCA 
AGATCCAG 
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FIGURE 237 



GCGGCGGCTATGCCGCTTGCTCTGCTCGTCCTGTTGCTCCTGGGGCCCGGCGGCTGGTGCCT 

TGCAGAACCCCCACGCGACAGCCTGCGGGAGGAACTTGTCATCACCCCGCTGCCTTCCGGGG 

ACGTAGCCGCCACATTCCAGTTCCGCACGCGCTGGGATTCGGAGCTTCAGCGGGAAGGAGTG 

TCCCATTACAGGCTCTTTCCCT^GCCCTGGGGCAGCTGATCTCCAAGTATTCTC^ 

GCTGCACCTGTCATTCACACAAGGCTTTTGGAGGACCCGATACTGGGGGCCACCCTTCCTGC 

AGGCCCCATCAGGTGCAGAGCTGTGGGTCTGGTTCCAAGACACTGTCACTGATGTGGATAAA 

TCTTGGAAGGAGCTCAGTAATGTCCTCTCAGGGATCTTCTGCGCCTCTCTCAA 

CTCCACO^CACAGTCACTCCCACTGCCTCCTTCAAACCCCTGGGTCTGGCCAATGACACT 

ACCACTACTTTCTGCGCTATGCTGTGCTGCCGCGGGAGGTGGTCTGCACCGAAAACCTCACC 

CCCTGGAAGAAGCTCTTGCCCTGTAGTTCCAAGGCAGGCCTCTCTGTGCTGCTGAAGGCAGA 

TCGCTTGTTCCACACCAGCTACCACTCCCAGGCAGTGCATATCCGCCCTGTTTGCAGAAATG 

CACGCTGTACTAGCATCTCCTGGGAGCTGAGGCAGACCCTGTCAGTTGTATTTGATGCCTTC 

ATCACGGGGCAGGG7VAAGAAAGACTGGTCCCTCTTCCGGATGTTCTCCCGAACCCTCACGGA 

GCCCTGCCCCCTGGCTTCAGAGAGCCGAGTCTATGTGGACATCACCACCTACAACCAGGACA 

ACGAGACATTAGAGGTGCACCCACCCCCGACCACTACATATCAGGACGTCATCCTAGGCACT 

CGGAAGACCTATGCCATCTATGACTTGCTTGACACCGCCATGATCAACAACTCTCGAAACCT 

CAACATCCAGCTCAAGTGGAAGAGACCCCCAGAGAATGAGGCCCCCCCAGTGCCCTTCCTGC 

ATGCCCAGCGGTACGTGAGTGGCTATGGGCTGCAGAAGGGGGAGCTGAGCACACTGCTGTAC 

AACACCCACCCATACCGGGCCTTCCCGGTGCTGCTGCTGGACACCGTACCCTGGTATCTGCG 

GCTGTATGTGCACACCCTCACCATCACCTCCAAGGGCAAGGAGAACAAACCAAGTTACATCC 

ACTACCAGCCTGCCCAGGACCGGCTGCAACCCCACCTCCTGGAGATGCTGATTCAGCTGCCG 

GCCAACTCAGTCACCAAGGTTTCCATCCAGTTTGAGCGGGCGCTGCTGAAGTGGACCGAGTA 

CACGCCAGATCCTAACCATGGCTTCTATGTCAGCCCATCTGTCCTCAGCGCCCTTGTGCCCA 

GCATGGTAGCAGCCAAGCCAGTGGACTGGGAAGAGAGTCCCCTCTTCAACAGCCTGTTCCCA 

GTCTCTGATGGCTCTAACTACTTTGTGCGGCTCTACACGGAGCCGCTGCTGGTGAACCTGCC 

GACACCGGACTTCAGCATGCCCTACAACGTGATCTGCCTCACGTGCACTGTGGTGGCCGTGT 

GCTACGGCTCCTTCTACAATCTCCTCACCCGAACCTTCCACATCGAGGAGCCCCGCACAGGT 

GGCCTGGCCAAGCGGCTGGCCAACCTTATCCGGCGCGCCCGAGGTGTCCCCCCACTCTGATT 

CTTGCCCTTTCCAGCAGCTGCAGCTGCCGTTTCTCTCTGGGGAGGGGAGCCCAAGGGCTGTT 

TCTGCCACTTGCTCTCCTCAGAGTTGGCTTTTGAACCAAAGTGCCCTGGACCAGGTCAGGGC 

CTACAGCTGTGTTGTCCAGTACAGGAGCCACGAGCCAAATGTGGCATTTGAATTTGAATTAA 

CTTAGAAATTCATTTCCTCACCTGTAGTGGCCACCTCTATATTGAGGTGCTCAATAAGCAAA 

AGTGGTCGGTGGCTGCTGTATTGGACAGCACAGAAAAAGATTTCCATCACCACAGAAAGGTC 

GGCTGGCAGCACTGGCCAAGGTGATGGGGTGTGCTACACAGTGTATGTCACTGTGTAGTGGA 

TGGAGTTTACTGTTTGTGGAATAAAAACGGCTGTTTCCGTGGAAAAAAAAAAAA 
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MPLALLVLLLLGPGGWCLAEPPRDSLREELVITPLPSGDVAATFQFRTRWDSELQREGVSHY 

RLFPKALGQLISKYSLRELHLSFTQGFVmTRYWGPPFLQAPSGAELWVWFQDTVTDVDKSWK 

ELSNVLSGI FCASLNF IDSTOTWPTASFKPLQLAlTOTDHyFLRYAVLPREWCTE^ 

KLLPCSSKAGLSVLLKADRLFHTSYHSQAVHIRPVCRNARCTSISWELRQTLSWFDAFITG 

QGKKDWSLFRMFSRTLTEPCPLASESRVYVDITTYNQDNETLEVHPPPTTTYQDVILGTRKT 

YAIYDLLDTAMINNSRNLNIQLKWKRPPENEAPPVPFLHAQRYVSGYGLQKGELSTLLYNTH 

FY RAFP VLLLDTVP WYLRLYVHTLT I TS KGKENKPS Y I HYQPAQDRLQ PHLLEML I QLPANS 

VTKVSIQFERALLKWTEYTPDPNHGFYVSPSVLSALVPSMVAAKPVDWEESPLFNSLFPVSD 

GSNYFVRLYTEPLLVNLPTPDFSMPYNVICLTCTWAVCYGSFYNLLTRTFHIEEPRTGGLA 

KRLANLIRRARGVPPL 
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FIGURE 239 

CAACATGGGGTCCAGCAGCTTCTTGGTCCTCATGGTGTCTCTCGTTCTTGTGACCCTGGTGG 
CTGTGGAAGGAGTTAAAGAGGGTATAGAGAAAGCAGGGGTTTGCCCAGCTGACAACGTACGC 
TGCTTCAAGTCCGATCCTCCCCAGTGTCACACAGACCAGGACTGTCTGGGGGAAAGGAAGTG 
TTGTTACCTGCACTGTGGCTTCAAGTGTGTGATTCCTGTGAAGGAACTGGAAGAAGGAGGAA 
ACAAGGATGAAGATGTGTCT^GGCCATACCCTGAGCCAGGATGGGAGGCCAAGTGTCCAGGC 
TCCTCCTCTACCAGGTGTCCTCAGAAATGATGCTGGGTCCTTTCTACCTCTGGGGGTCACTC 
TCACTTGGCACCTGCCCCTGAGGGTCCTGAGACTTGGAATATGGAAGAAGCAATACCCAACC 
CCACCAAAGAAAACCTGAGCTTGAAGTCCTTTTCCCCAAAAAGAGGGAAGAGTCACAAAAAG 
TCCAGACCCCAGGGACGGTACTTTCCCTCTCTACCTGGTGCTCCTCCCTAATGCTCATGAAT 
GGACCCCTCATGAATGAAACCAGTGCCCTTATAAGAGACCCCAAAGAGCTGCCTTGCCCTTC 
TGCAATGTGTGATCACAGCTAGAAGGCACTGTCAGAGAAGAGAAACTGGTCCTCACCAGATG 
CTGAATCTGCTGGTGCCTTGATCTTGGACTTCCCAGCCTCTAGAACTGTAAGAAATAAATAT 
TTGCTGTTTATAATCCAA 
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FIGURE 240 



MGSSSFLVLMVSLVLVTLVAVEGVKEGIEKAGVCPADNVRCFKSDPPQCHTDQDCLGERKCC 
YLHCGFKCVIPVKELEEGGNKDEDVSRPYPEPGWEAKCPGSSSTRCPQK 
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A/^CTCAGCACTTGCCGGAGTGGCTCATTGTTAAGACAAAGGGTGTGCACTTCCTGGCCAGG 

AAACCTGAGCGGTGAGACTCCCAGCTGCCTACATCAAGGCCCCAGGACATGCAGAACCTTCC 

TCTAGAACCCGACCCACCACCATGAGGTCCTGCCTGTGGAGATGCAGGCACCTGAGCCAAGG 

CGTCCAGTGGTCCTTGCTTCTGGCTGTCCTGGTCTTCTTTCTCTTCGCCTTGCCCTCTTTTA 

TTAAGGAGCCTCAAACAAAGCCTTCCAGGCATCAACGCACAGAGAACATTAAAGAAAGGTCT 

CTACAGTCCCTGGCAAAGCCTAAGTCCCAGGCACCCACAAGGGCGAGGAGGACAACCATCTA 

TGCAGAGCCAGCGCCAGAGAACAATGCCCTCAACACACAAACCCAGCCCTyVGGCCCACAC 

CCGGAGACAGAGGAAAGGAGGCCAACCAGGCACCGCCGGAGGAGCAGGACAAGGTGCCCCAC 

ACAGCACAGAGGGCAGCATGGAAGAGCCCAGAAAAAGAGAAAACCATGGTGAACACACTGTC 

ACCCAGAGGGCAAGATGCAGGGATGGCCTCTGGCAGGACAGAGGCACAATCATGGAAGAGCC 

AGGACACAAAGACGACCCAAGGAAATGGGGGCCAGACCAGGAAGCTGACGGCCTCCAGGACG 

GTGTCAGAGAAGCACCAGGGCAAAGCGGCAACCACAGCCAAGACGCTCATTCCCAA/^GTCA 

GCACAGAATGCTGGCTCCCACAGGAGCAGTGTCAACAAGGACGAGACAGAAAGGAGTGACCA 

CAGCAGTCATCCCACCTAAGGAGAAGAAACCTCAGGCCACCCCACCCCCTGCCCCTTTCCAG 

AGCCCCACGACGCAGAGAAACCAAAGACTGAAGGCCGCCAACTTCAAATCTGAGCCTCGGTG 

GGATTTTGAGGAAAAATACAGCTTCGAAATAGGAGGCCTTCAGACGACTTGCCCTGACTCTG 

TGAAGATCAAAGCCTCCAAGTCGCTGTGGCTCCAGAAACTCTTTCTGCCCAACCTCACTCTC 

TTCCTGGACTCCAGACACTTCAACCAGAGTGAGTGGGACCGCCTGGAACACTTTGCACCACC 

CTTTGGCTTCATGGAGCTCAACTACTCCTTGGTGCAGAAGGTCGTGACACGCTTCCCTCCAG 

TGCCCCAGCAGCAGCTGCTCCTGGCCAGCCTCCCCGCTGGGAGCCTCCGGTGCATCACCTGT 

GCCGTGGTGGGCAACGGGGGCATCCTGAACAACTCCCACATGGGCCAGGAGATAGACAGTCA 

CGACTACGTGTTCCGATTGAGCGGAGCTCTCATTAAAGGCTACGAACAGGATGTGGGGACTC 

GGACATCCTTCTACGGCTTTACCGCCTTCTCCCTGACCCAGTCACTCCTTATATTGGGCAAT 

CGGGGTTTCAAGAACGTGCCTCTTGGGAAGGACGTCCGCTACTTGCACTTCCTGGAAGGCAC 

CCGGGACTATGAGTGGCTGGAAGCACTGCTTATGAATCAGACGGTGATGTCAAAAAACCTTT 

TCTGGTTCAGGCACAGACCCCAGGAAGCTTTTCGGGAAGCCCTGCACATGGACAGGTACCTG 

TTGCTGCACCCAGACTTTCTCCGATACATGAAGAACAGGTTTCTGAGGTCTAAGACCCTGGA 

TGGTGCCCACTGGAGGATATACCGCCCCACCACTGGGGCCCTCCTGCTGCTCACTGCCCTTC 

AGCTCTGTGACCAGGTGAGTGCTTATGGCTTCATCACTGAGGGCCATGAGCGCTTTTCTGAT 

CACTACTATGATACATCATGGAAGCGGCTGATCTTTTACATAT^CCATGACTTCAAGCTGGA 

GAGAGAAGTCTGGAAGCGGCTACACGATGAAGGGATAATCCGGCTGTACCAGCGTCCTGGTC 

CCGGAACTGCCAAAGCCAAGAACTGACCGGGGCCAGGGCTGCCATGGTCTCCTTGCCTGCTC 

CAAGGCACAGGATACAGTGGGAATCTTGAGACTCTTTGGCCATTTCCCATGGCTCAGACTAA 

GCTCCAAGCCCTTCAGGAGTTCCAAGGGAACACTTGAACCATGGACAAGACTCTCTCAAGAT 

GG CAAATGGCTAATTGAGGTTCTGAAGTTCTTCAGTACATTGCTGTAGGTCCTGAGGCCAGG 

GATTTTTAATTAAATGGGGTGATGGGTGGCCAATACCACAATTCCTGCTGAAAAACACTCTT 

CCAGTCCAAAAGCTTCTTGATACAGAAAAAAGAGCCTGGATTTACAGAAACATATAGATCTG 

GTTTGAATTCCAGATCGAGTTTACAGTTGTGAAATCTTGAAGGTATTACTTAACTTCACTAC 

AGATTGTCTAGAAGACCTTTCTAGGAGTTATCTGATTCTAGAAGGGTCTATACTTGTCCTTG 

TCTTTAAGCTATTTGACAACTCTACGTGTTGTAGAAAACTGATAATAATACAAATGATTGTT 

GTCCATGGAAAGGCAAATAAATTTTCTACAGTGAAAAAAAAAAAAAAA 
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FIGURE 242 

MRSCLWRCRHLSQGVQWSLLLAVLVFFLFALPSFIKEPQTKPSRHQRTENIKERSLQSLAKP 
KSQAPTRARRTTIYAEPAPENNALNTQTQPKAHTTGDRGKEANQAPPEEQDKVPHTAQRAAW 
KSPEKEKTMVNTLSPRGQDAGMASGRTEAQSWKSQDTKTTQGNGGQTRKLTASRTVSEKHQG 
KAATTAKTLI PKSQHRMLAPTGAVSTRTRQKGVTTAVI PPKEKKPQATPPPAPFQSPTTQRN 
QRLKAANFKSEPRWDFEEKYSFEIGGLQTTCPDSVKIKASKSLWLQKLFLPNLTLFLDSRHF 
NQSEWDRLEHFAPPFGFMELNYSLVQKWTRFPPVPQQQLLLASLPAGSLRCITCAWGNGG 
ILNNSHMGQEIDSHDYVFRLSGALIKGYEQDVGTRTSFYGFTAFSLTQSLLILGNRGFKNVP 
LGKDVRYLHFLEGTRDYEWLEALLMNQTVMSKNLFWFRHRPQEAFREALHMDRYLLLHPDFL 
RYMKNRFLRSKTLDGAHWRIYRPTTGALLLLTALQLCDQVSAYGFITEGHERFSDHYYDTSW 
KRLIFYINHDFKLEREVWKRLHDEGI IRLYQRPGPGTAKAKN 
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CGATGCGCGGACCCGGGCACCCCCTCCTCCTGGGGCTGCTGCTGGTGCTGGGGCCTTCGCCG 

GAGCAGCGAGTGGAT^TTGTTCCTCGAGATCTGAGGATGAAGGACAAGTTTCTAAAACACCT 

TACAGGCCCTCITTATTTTAGTCCAAAGTGCTVGCAAACACTTCCATAGACTTTATCAC^ 

CCAGAGACTGCACCATTCCTGCATACTATAAAAGATGCGCCAGGCTTCTTACCCGGCTGGCT 

GTCAGTCCAGTGTGCATGGAGGATAAGTGAGCAGACCGTACAGGAGCAGCACACCAGGAGCC 

ATGAGAAGTGCCTTGGAAACCAACAGGGAAACAGAACTATCTTTATACACATCCCCTCATGG 

ACAAGAGATTTATTTTTGCAGACAGACTCTTCCATAAGTCCTTTGAGTTTTGTATGTTGTTG 

ACAGTTTGCAGATATATATTCGATAAATCAGTGTACTTGACAGTGTTATCTGTCACTTATTT 
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FIGURE 244 

MRGPGHPLLLGLLLVLGPSPEQRVEIVPRDLRMKDKFLKHLTGPLYFSPKCSKHFHRLYHNT 
RDCTIPAYYKRCARLLTRliAVSPVCMEDK 
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FIGURE 245 

GGGCTGGGCCCCGCCGCAGCTCCAGCTGGCCGGCTTGGTCCTGCGGTCCCTTCTCTGGGAGG 
CCCGACCCCGGCCGCGCCCAGCCCCCACCATGCCACCCGCGGGGCTCCGCCGGGCCGCGCCG 
CTCACCGCAATCGCTCTGTTGGTGCTGGGGGCTCCCCTGGTGCTGGCCGGCGAGGACTGCCT 
GTGGTACCTGGACCGGAATGGCTCCTGGCATCCGGGGTTTAACTGCGAGTTCTTCACCTTCT 
GCTGCGGGACCTGCTACCATCGGTACTGCTGCAGGGACCTGACCTTGCTTATCACCGAGAGG 
CAGCAGAAGCACTGCCTGGCCTTCAGCCCCAAGACCATAGCAGGCATCGCCTCAGCTGTGAT 
CCTCTTTGTTGCTGTGGTTGCCACCACCATCTGCTGCTTCCTCTGTTCCTGTTGCTACCTGT 
ACCGCCGGCGCCAGCAGCTCCAGAGCCCATTTGAAGGCCAGGAGATTCCAATGACAGGCATC 
CCAGTGCAGCCAGTATACCCATACCCCCAGGACCCCAAAGCTGGCCCTGCACCCCCACAGCC 
TGGCTTCATGTACCCACCTAGTGGTCCTGCTCCCCAATATCCACTCTACCCAGCTGGGCCCC 
CAGTCTACAACCCTGCAGCTCCTCCTCCCTATATGCCACCACAGCCCTCTTACCCGGGAGCC 
TGAGGAACCAGCCATGTCTCTGCTGCCCCTTCAGTGATGCCAACCTTGGGAGATGCCCTCAT 
CCTGTACCTGCATCTGGTCCTGGGGGTGGCAGGAGTCCTCCAGCCACCAGGCCCCAGACCAA 
GCCAAGCCCTGGGCCCTACTGGGGACAGAGCCCCAGGGAAGTGGAACAGGAGCTGAACTAGA 
ACTATGAGGGGTTGGGGGGAGGGCTTGGAATTATGGGCTATTTTTACTGGGGGCAAGGGAGG 
GAGATGACAGCCTGGGTCACAGTGCCTGTTTTCAAATAGTCCCTCTGCTCCCAAGATCCCAG 
CCAGGAAGGCTGGGGCCCTACTGTTTGTCCCCTCTGGGCTGGGGTGGGGGGAGGGAGGAGGT 
TCCGTCAGCAGCTGGCAGTAGCCCTCCTCTCTGGCTGCCCCACTGGCCACATCTCTGGCCTG 
CTAGATTAAAGCTGTAAAGACAAAA 
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FIGURE 246 

MPPAGLRRAAPLTAIALLVLGAPLVLAGEDCLWYLDRNGSWHPGFNCEFFTFCCGTCYHRYC 
CRDLTLLITERQQKHCLAFSPKTIAGIASAVILFVAWATTICCFLCSCCYLYRRRQQLQSP 
FEGQEIPMTGIPVQPVYPYPQDPKAGPAPPQPGFMYPPSGPAPQYPLYPAGPPVYNPAAPPP 
YMPPQPSYPGA 
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FIGURE 247 

GGGGGAGCTAGGCCGGCGGCAGTGGTGGTGGCGGCGGCGCAAGGGTGAGGGCGGCCCCAGAA 
CCCCAGGTAGGTAGAGCAAGAAGATGGTGTTTCTGCCCCTCAAATGGTCCCTTGCAACCATG 
TCATTTCTACTTTCCTCACTGTTGGCTCTCTTAACTGTGTCCACTCCTTCATGGTGTCAGAG 
CACTGAAGCATCTCCAAAACGTAGTGATGGGACACCATTTCCTTGGAATAAAATACGACTTC 
CTGAGTACGTCATCCCAGTTCATTATGATCTCTTGATCCATGCAAACCTTACCACGCTGACC 
TTCTGGGGAACCACGAAAGTAGAAATCACAGCCAGTCAGCCCACCAGCACCATCATCCTGCA 
TAGTCACCACCTGCAGATATCTAGGGCCACCCTCAGGAAGGGAGCTGGAGAGAGGCTATCGG 
AAGAACCCCTGCAGGTCCTGGAACACCCCCCTCAGGAGCAAATTGCACTGCTGGCTCCCGAG 
CCCCTCCTTGTCGGGCTCCCGTACACAGTTGTCATTCACTATGCTGGCAATCTTTCGGAGAC 
TTTCCACGGATTTTACAAAAGCACCTACAGAACCAAGGAAGGGGAACTGAGGATACTAGCAT 
CAACACAATTTGAACCCACTGCAGCTAGAATGGCCTTTCCCTGCTTTGATGAACCTGCCTTC 
AAAGCAAGTTTCTCAATCAAAATTAGAAGAGAGCCAAGGCACCTAGCCATCTCCAATATGCC 
ATTGGTGAAATCTGTGACTGTTGCTGAAGGACTCATAGAAGACCATTTTGATGTCACTGTGA 
AGATGAGCACCTATCTGGTGGCCTTCATCATTTCAGATTTTGAGTCTGTCAGCAAGATAACC 
AAGAGTGGAGTCAAGGTTTCTGTTTATGCTGTGCCAGACAAGATAAATCAAGCAGATTATGC 
ACTGGATGCTGCGGTGACTCTTCTAGAATTTTATGAGGATTATTTCAGCATACCGTATCCCC 
TACCCAAACAAGATCTTGCTGCTATTCCCGACTTTCAGTCTGGTGCTATGGAAAACTGGGGA 
CTGACAACATATAGAGAATCTGCTCTGTTGTTTGATGCAGAAAAGTCTTCTGCATCAAGTAA 
GCTTGGCATCACAGTGACTGTGGCCCATGAACTGGCCCACCAGTGGTTTGGGAACCTGGTCA 
CTATGGAATGGTGGAATGATCTTTGGCTAAATGAAGGATTTGCCAAATTTATGGAGTTTGTG 
TCTGTCAGTGTGACCCATCCTGAACTGAAAGTTGGAGATTATTTCTTTGGCAAATGTTTTGA 
CGCAATGGAGGTAGATGCTTTAAATTCCTCACACCCTGTGTCTACACCTGTGGAAAATCCTG 
CTCAGATCCGGGAGATGTTTGATGATGTTTCTTATGATAAGGGAGCTTGTATTCTGAATATG 
CTAAGGGAGTATCTTAGCGCTGACGCATTTAAAAGTGGTATTGTACAGTATCTCCAGAAGCA 
TAGCTATAAAAATACAAAAAACGAGGACCTGTGGGATAGTATGGCAAGTATTTGCCCTACAG 
ATGGTGTAAAAGGGATGGATGGCTTTTGCTCTAGAAGTCAACATTCATCTTCATCCTCACAT 
TGGCATCAGGAAGGGGTGGATGTGAAAACCATGATGAACACTTGGACACTGCAGAGGGGTTT 
TCCCCTAATAACCATCACAGTGAGGGGGAGGAATGTACACATGAAGCAAGAGCACTACATGA 
AGGGCTCTGACGGCGCCCCGGACACTGGGTACCTGTGGCATGTTCCATTGACATTCATCACC 
AGCA/VATCCAACATGGTCCATCGATTTTTGCTAAAAACT^AAAACAGATGTGCTCATCCTCCC 
AGAAGAGGTGGAATGGATCAAATTTAATGTGGGCATGAATGGCTATTACATTGTGCATTACG 
AGGATGATGGATGGGACTCTTTGACTGGCCTTTTAAAAGGAACACACACAGCAGTCAGCAGT 
AATGATCGGGCAAGTCTCATTAACAATGCATTTCAGCTCGTCAGCATTGGGAAGCTGTCCAT 
TGAAAAGGCCTTGGATTTATCCCTGTACTTGAAACATGAAACTGAAATTATGCCCGTGTTTC 
AAGGTTTGAATGAGCTGATTCCTATGTATAAGTTAATGGAGAAAAGAGATATGAATGAAGTG 
GT^CTCAATTCAAGGCCTTCCTCATCAGGCTGCTAAGGGACCTCATTGATAAGCAGACATG 
GACAGACGAGGGCTCAGTCTCAGAGCAAATGCTGCGGAGTGAACTACTACTCCTCGCCTGTG 
TGCACAACTATCAGCCGTGCGTACAGAGGGCAGAAGGCTATTTCAGAAAGTGGAAGGAATCC 
AATGGAAACTTGAGCCTGCCTGTCGACGTGACCTTGGCAGTGTTTGCTGTGGGGGCCCAGAG 
CACAGAAGGCTGGGATTTTCTTTATAGTAAATATCAGTTTTCTTTGTCCAGTACTGAGAAAA 
GCCAAATTGAATTTGCCCTCTGCAGAACCCAAAATAAGGAAAAGCTTCAATGGCTACTAGAT 
GAAAGCTTTAAGGGAGATAAAATAAAAACTCAGGAGTTTCCACAAATTCTTACACTCATTGG 
CAGGAACCCAGTAGGATACCCACTGGCCTGGCAATTTCTGAGGAAAAACTGGAACAAACTTG 
TACAAAAGTTTGAACTTGGCTCATCTTCCATAGCCCACATGGTAATGGGTACAACAAATCAA 
TTCTCCACAAGAACACGGCTTGAAGAGGTAAAAGGATTCTTCAGCTCTTTGAAAGAAAATGG 
TTCTCAGCTCCGTTGTGTCCAACAGACAATTGAAACCATTGAAGAAAACATCGGTTGGATGG 
ATAAGAATTTTGATAAAATCAGAGTGTGGCTGCAAAGTGAAAAGCTTGAACGTATGTAAAAA 
TTCCTCCCTTGCCCGGTTCCTGTTATCTCTAATCACCAACATTTTGTTGAGTGTATTTTCAA 
ACTAGAGATGGCTGTTTTGGCTCCAACTGGAGATACTTTTTTCCCTTCAACTCATTTTTTGA 
CTATCCCTGTGAAAAGAATAGCTGTTAGTTTTTCATGAATGGGCTTTTTCATGAATGGGCTA 
TCGCTACCATGTGTTTTGTTCATCACAGGTGTTGCCCTGCAACGTAAACCCAAGTGTTGGGT 
TCCCTGCCACAGAAGAATAAAGTACCTTATTCTTCTCAAAAAAT^T^AAAAAAAAAAAAAAAAAAA 
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FIGURE 248 

^^FLPLKWSIiATMSFLLSSLIiALLTVSTPSWCQSTEASPKRSDGTPFPWNKIRLPEyVIPV^ 
YDLLIHANLTTLTFWGTTKVEITASQPTSTI I LHSHHLQI SRATLRKGAGERLSEEPLQVLE 
HPPQEQIALLAPEPLLVGLPYTWIHYAGNLSETFHGFYKSTYRTKEGELRILASTQFEPTA 
ARMAFPCFDEPAFKASFS IKIRREPRHLAI SNMPLVKSVTVAEGLIEDHFDVTVKMSTYLVA 
FIISDFESVSKITKSGVKVSVYAVPDKINQADYALDAAVTLLEFYEDYFSIPYPLPKQDLAA 
I PDFQSGAMENWGLTTYRES ALLFDAEKSSASSKLG ITVTVAHELAHQWFGNLVTMEl^ 
WLNEGFAKFMEFVSVSVTHPELKVGDYFFGKCFDAMEVDALNSSHPVSTPVENPAQIREMFD 
DVSYDKGACILNMLREYLSADAFKSGIVQYLQKHSYKNTKNEDLWDSMASICPTDGVKGMDG 
FCSRSQHSSSSSHWHQEGVDWTMMNTWTLQRGFPLITITVRGRirraM 

TG YLWHVPLTF I TS KSNMVHRFLLKTKTDVL I LPEE VEW I KFNVGMNG YY I VHYEDDGWDSL 

TGLLKGTHTAVSSNDRASLINNAFQLVS IGKLS I EKALDLSLYLKHETEIMPVFQGLNELI P 

MYKLMEKRDMNEVETQFKAFLIRLLRDLIDKQTWTDEGSVSEQMLRSELLLLAC^ 

QRAEGYFRKWKESNGNLSLPVDVTLAVFAVGAQSTEGWDFLYSKYQFSLSSTEKSQIEFALC 

RTQNKEKLQWLLDESFKGDKIKTQEFPQILTLIGRNPVGYPLAWQFLRKNWNKLVQKFELGS 

SSIT^VMGTTNQFSTRTRLEEVKGFFSSLKENGSQLRCVQQTIETIEENIGWMDKNFDKIR 

VWLQSEKLERM 
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CAGCCACAGACGGGTCATGAGCGCGGTATTACTGCTGGCCCTCCTGGGGTTCATCCTCCCAC 
TGCCAGGAGTGCAGGCGCTGCTCTGCCAGTTTGGGACAGTTCAGCATGTGTGGAAGGTGTCC 
GACCTACCCCGGCAATGGACCCCTAAGAACACCAGCTGCGACAGCGGCTTGGGGTGCCAGGA 
CACGTTGATGCTCATTGAGAGCGGACCCCAAGTGAGCCTGGTGCTCTCCAAGGGCTGCACGG 
AGGCCAAGGACCAGGAGCCCCGCGTCACTGAGCACCGGATGGGCCCCGGCCTCTCCCTGATC 
TCCTACACCTTCGTGTGCCGCCAGGAGGACTTCTGCAACAACCTCGTTAACTCCCTCCCGCT 
TTGGGCCCCACAGCCCCCAGCAGACCCAGGATCCTTGAGGTGCCCAGTCTGCTTGTCTATGG 
AAGGCTGTCTGGAGGGGACAACAGAAGAGATCTGCCCCAAGGGGACCACACACTGTTATGAT 
GGCCTCCTCAGGCTCAGGGGAGGAGGCATCTTCTCCAATCTGAGAGTCCAGGGATGCATGCC 
CCAGCCAGGTTGCAACCTGCTCAATGGGACACAGGAAATTGGGCCCGTGGGTATGACTGAGA 
ACTGCAATAGGAAAGATTTTCTGACCTGTCATCGGGGGACCACCATTATGACACACGGAAAC 
TTGGCTCAAGAACCCACTGATTGGACCACATCGAATACCGAGATGTGCGAGGTGGGGCAGGT 
GTGTCAGGAGACGCTGCTGCTCATAGATGTAGGACTCACATCAACCCTGGTGGGGACAAAAG 
GCTGCAGCACTGTTGGGGCTCAAAATTCCCAGAAGACCACCATCCACTCAGCCCCTCCTGGG 
GTGCTTGTGGCCTCCTATACCCACTTCTGCTCCTCGGACCTGTGCAATAGTGCCAGCAGCAG 
CAGCGTTCTGCTGAACTCCCTCCCTCCTCAAGCTGCCCCTGTCCCAGGAGACCGGCAGTGTC 
CTACCTGTGTGCAGCCCCTTGGAACCTGTTCAAGTGGCTCCCCCCGAATGACCTGCCCCAGG 
GGCGCCACTCATTGTTATGATGGGTACATTCATCTCTCAGGAGGTGGGCTGTCCACCAAAAT 
GAGCATTCAGGGCTGCGTGGCCCAACCTTCCAGCTTCTTGTTGAACCACACCAGACAAATCG 
GGATCTTCTCTGCGCGTGAGAAGCGTGATGTGCAGCCTCCTGCCTCTCAGCATGAGGGAGGT 
GGGGCTGAGGGCCTGGAGTCTCTCACTTGGGGGGTGGGGCTGGCACTGGCCCCAGCGCTGTG 
GTGGGGAGTGGTTTGCCCTTCCTGCTAACTCTATTACCCCCACGATTCTTCACCGCTGCTGA 
CCACCCACACTCAACCTCCCTCTGACCTCATAACCTAATGGCCTTGGACACCAGATTCTTTC 
CCATTCTGTCCATGAATCATCTTCCCCACACACAATCATTCATATCTACTCACCTAACAGCA 
ACACTGGGGAGAGCCTGGAGCATCCGGACTTGCCCTATGGGAGAGGGGACGCTGGAGGAGTG 
GCTGCATGTATCTGATAATACAGACCCTGTCCTTTCA 
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MSAVLLLALLGFILPLPGVQALLCQFGTVQHVWKVSDLPRQWTPKNTSCDSGLGCQDTLMLI 
ESGPQVSLVLSKGCTEAKDQEPRVTEHRMGPGLSLISYTFVCRQEDFCNNLVNSLPLWAPQP 
PADPGSLRCPVCLSMEGCLEGTTEEICPKGTTHCYDGLLRLRGGGIFSNLRVQGCMPQPGCN 
LLNGTQEIGPVGMTENCNRKDFLTCHRGTTIMTHGNLAQEPTDWTTSNTEMCEVGQVCQETL 
LLIDVGLTSTLVGTKGCSTVGAQNSQKTTIHSAPPGVLVASYTHFCSSDLCNSASSSSVLLN 
SLPPQAAPVPGDRQCPTCVQPLGTCSSGSPRMTCPRGATHCYDGYIHLSGGGLSTKMSIQGC 
VAQPSSFLIJ^JHTRQIGIFSAREKRDVQPPASQHEGGGAEGLESLTWGVGLT^^ 
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GCGACGGGCAGGACGCCCCGTTCGCCTAGCGCGTGCTCAGGAGTTGGTGTCCTGCCTGCGCT 
CAGGATGAGGGGGAATCTGGCCCTGGTGGGCGTTCTAATCAGCCl^^ 

CCATCTGGACATCCTCAGCCGGCTGGCGATGACGCCTGCTCTGTGCAGATCCTCGTCCCTGG 

CCTCAAAGGGGATGCGGGAGAGAAGGGAGACAAAGGCGCCCCCGGACGGCCTGGAAGA 

GCCCCACGGGAGAAAAAGGAGACATGGGGGACAAAGGACAGAAAGGCAGTGTGGGTCGTCAT 

GGAAAAATTGGTCCCATTGGCTCTAAAGGTGAGAAAGGAGATTCCGGTGACATAGGACCCCC 

TGGTCCTAATGGAGAACCAGGCCTCCCATGTGAGTGCAGCCAGCTGCGCAAGGCCATCGGGG 

AGATGGACAACCAGGTCTCTCAGCTGACCAGCGAGCTCT^GTTCATCAAGAATGCTGTCGCC 

GGTGTGCGCGAGACGGAGAGCAAGATCTACCTGCTGGTGAAGGAGGAGAAGCGCTACGCGGA 

CGCCCAGCTGTCCTGCCAGGGCCGCGGGGGCACGCTGAGCATGCCCAAGGACGAGGCTGCCA 

ATGGCCTGATGGCCGCATACCTGGCGCAAGCCGGCCTGGCCCGTGTCTTCATCGGCATCAAC 

GACCTGGAGAAGGAGGGCGCCTTCGTGTACTCTGACCACTCCCCCATGCGGACCTTCAACAA 

GTGGCGCAGCGGTGAGCCCAACAATGCCTACGACGAGGAGGACTGCGTGGAGATGGTGGCCT 

CGGGCGGCTGGAACGACGTGGCCTGCCACACCACCATGTACTTCATGTGTGAGTTTGACAAG 

GAGAACATGTGAGCCTCAGGCTGGGGCTGCCCATTGGGGGCCCCACATGTCCCTGCAGGGTT 

GGCAGGGACAGAGCCCAGACCATGGTGCCAGCCAGGGAGCTGTCCCTCTGTGAAGGGTGGAG 

GCTCACTGAGTAGAGGGCTGTTGTCTAAACTGAGAAAATGGCCTATGCTTAAGAGGAAAATG 

AAAGTGTTCCTGGGGTGCTGTCTCTGAAGAAGCAGAGTTTCATTACCTGTATTGTAGCCCCA 

ATGTCATTATGTAATTATTACCCAGAATTGCTCTTCCATAAAGCTTGTGCCTTTGTCCAAGC 

TATACAATAAAATCTTTAAGTAGTGCAGTAGTTAAGTCCAAAAAAAAAAAAAAAAAAA 
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MRGNLALVGVLI SLAFLSLLPSGHPQPAGDDACS VQ I LVPGLKGDAGEKGDKGAPGRPGRVG 

PTGEKGDMGDKGQKGSVGRHGKIGPIGSKGEKGDSGDIGPPGPNGEPGLPCECSQLRKAIGE 

MDNQVSQLTSELKFIKNAVAGVRETESKIYLLVKEEKRYADAQLSCQGRGGTLSMPKDEAAN 

GLMAAYLAQAGLARVFIGINDLEKEGAFVYSDHSPMRTFNKWRSGEPNNAYDEEDCVEMVAS 
GGWNDVACHTTMYFMCEFDKENM 
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AGTGACTGCAGCCTTCCTAGATCCCCTCCACTCGGTTTCTCTCTTTGCAGGAGCACCGGCAG 
CACCAGTGTGTGAGGGGAGCAGGCAGCGGTCCTAGCCAGTTCCTTGATCCTGCCAGACCACC 
CAGCCCCCGGCACAGAGCTGCTCCACAGGCACCATGAGGATCATGCTGCTATTCACAGCCAT 
CCTGGCCTTCAGCCTAGCTCAGAGCTTTGGGGCTGTCTGTAAGGAGCCACAGGAGGAGGTGG 
TTCCTGGCGGGGGCCGCAGCAAGAGGGATCCAGATCTCTACCAGCTGCTCCAGAGACTCTTC 
AAAAGCCACTCATCTCTGGAGGGATTGCTCAAAGCCCTGAGCCAGGCTAGCACAGATCCTAA 
GGAATCAACATCTCCCGAGAAACGTGACATGCATGACTTCTTTGTGGGACTTATGGGCAAGA 
GGAGCGTCCAGCCAGAGGGAAAGACAGGACCTTTCTTACCTTCAGTGAGGGTTCCTCGGCCC 
CTTCATCCCAATCAGCTTGGATCCACAGGAAAGTCTTCCCTGGGAACAGAGGAGCAGAGACC 
TTTATAAGACTCTCCTACGGATGTGAATCAAGAGAACGTCCCCAGCTTTGGCATCCTCAAGT 
ATCCCCCGAGAGCAGAATAGGTACTCCACTTCCGGACTCCTGGACTGCATTAGGAAGACCTC 
TTTCCCTGTCCCAATCCCCAGGTGCGCACGCTCCTGTTACCCTTTCTCTTCCCTGTTCTTGT 
AACATTCTTGTGCTTTGACTCCTTCTCCATCTTTTCTACCTGACCCTGGTGTGGAAACTGCA 
TAGTGAATATCCCCAACCCCAATGGGCATTGACTGTAGAATACCCTAGAGTTCCTGTAGTGT 
CCTACATTAAAAATATAATGTCTCTCTCTATTCCTCAACAATAAAGGATTTTTGCATATGAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 2^4 



MRIMLLFTAILAFSLAQSFGAVCKEPQEEWPGGGRSKRDPDLYQLLQRLFKSHSSLEGLLK 

ALSQASTDPKESTSPEKRDMHDFFVGLMGKRSVQPEGKTGPFLPSVRVPRPLHPNQLGSTGK 
SSLGTEEQRPL 
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FIGURE 255 

GGGCGTCTCCGGCTGCTCCTATTGAGCTGTCTGCTCGCTGTGCCCGCTGTGCCTGCTGTGCC 
CGCGCTGTCGCCGCTGCTACCGCGTCTGCTGGACGCGGGAGACGCCAGCGAGCTGGTGATTG 
GAGCCCTGCGGAGAGCTCAAGCGCCCAGCTCTGCCCCAGGAGCCCAGGCTGCCCCGTGAGTC 
CCATAGTTGOTGCAGGAGTGGAGCCATGAGCTGCGTCCTGGGTGGTGTCATCCCCTTGGGGC 
TGCTGTTCCTGGTCTGCGGATCCCAAGGCTACCTCCTGCCCAACGTCACTCTCTTAGAGGAG 
CTGCTCAGCAAATACCAGCACAACGAGTCTCACTCCCGGGTCCGCAGAGCCATCCCCAGGGA 
GGACAAGGAGGAGATCCTCATGCTGCACAACAAGCTTCGGGGCCAGGTGCAGCCTCAGGCCT 
CCAACATGGAGTACATGGTGAGCGCCGGCTCCGGCCGCAGAGGCTGGCACCGGGGGTGGGGC 
CTGGGCCACCAGCCTGCTCTGTTCCCCAGCCAGCTCTGTTCCCCAGCCAGTGCGTGTGATGG 
CTGGCTCAGGGTCTCCTCTGGCAGGGGAGGATCCCGGCTCTGTTCTGTTTTGTTTGTTTGTT 
TTGAGACAGGGTCTCACTCTGCCACTGACGCTGGAGTGCAATGGCACAATCGTCATGCCCTG 
AAACCTTAGACTCCCGGGGTTAAGCGATCCTGCTTCAGCCTCCCAAGTAGCTGGAACTACAG 
GCATGCACCATGGTGCCCAGCTAGATTTTAAATATTTTGTGGAGATGGGGGTCTTGCTACGT 
TGCCCAGGCTGGTCTTGAACTCCTAGGCTCAAGCAATCCTCCTGCCTCAGCCTCTCAAAGTG 
CTAGGATTATAGGCATGAGTCACCCTGTCTGGCTCTGGCTCTGTTCTTAACATTCTGCCAAA 
ACAACACACGTGGGTTCCCTGTGCAGAGCCTGCCTCGTTGCCTTCATGTCACTCTTGGTAGC 
TCCACTGGGAACACAGCTCTCAGCCTTTCCCACCTGGAGGCAGAGTGGGGAGGGGCCCAGGG 
CTGGGCTTTGCTGATGCTGATCTCAGCTGTGCCACACGCTAGCTGCACCACCCTGACTTCTC 
CTTAGCCCGTGTGAGCCTCACTTTCCACTTGGAGAGTCCTTCCTCGCGTGGTTGCCATGACT 
GTGAGATAAGTCGAGGCTGTGAAGGGCCCGGCACAGACTGACCTGCCTCCCCAACCCCTAGG 
CTTTGCTAACCGGGAAAGGAGCTAACGGTGACAGAAGACAGCCAAGGTCAACCCTCCCGGGT 
GATTGTGATGGGTGTTCCAGGTGTGGTTGGGCGATGCTGCTACTTGACCCCAAGCTCCAGTG 
TGGAAACTTCCTTCCTGGCTGGTTTTCCAGAACTACAGAGGAATGGACCACAGTCTTCCAGG 
GTCCCTCCTCGTCCACCAACCGGGAGCCTCCACCTTGGCCATCCGTCAGCTATGAATGGCTT 
TTTAAACAAACCCACGTCCCAGCCTGGGTAACATGGTAAAGCCCCGTCTCTACAAAAAAATC 
CAAGTTAGCCGGGCATGGTGGTGCGCACCTGTAGTCCCAGCTGCAGTGGGACTGAGGTGGAG 
GTGGAGGTGGGGGGTGGGAGCTGAGGAAGGAGGATCGCTTGAGCCTGGGAAGTCGAGGCTGC 
AGTGAGCTGAGATTGCACCACTGCACTCCAGCCTGGGTGACAGAGCAAGACCCTGTCTCAAAAA 
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FIGURE 256 

MSCVLGGVIPLGLLFLVCGSQGYLLPNVTLLEELLSKYQHNESHSRVRRAIPREDKEEILML 
HNKLRGQVQPQASNMEYMVSAGSGRRGWHRGWGLGHQPALFPSQLCSPASACDGWLRVSSGR 
GGSRLCSVLFVCFETGSHSATDAGVQWHNRHALKP 
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FIGURE 257 

AAGGAGAGGCCACCGGGACTTCAGTGTCTCCTCCATCCCAGGAGCGCAGTGGCCAC TATGG G 

GTCTGGGCTGCCCCTTGTCCrCCTCTTGACCCTCCTTGGCAGCTCACATGGAACAGGGCCGG 

GTATGACTTTGCAACTGAAGCTGAAGGAGTCTTTTCTGACAAATTCCTCCTATGAGTCCAGC 

TTCCTGGAATTGCTTGAAAAGCTCTGCCTCCTCCTCCATCTCCCTTCAGGGACCAGCGTCAC 

CCTCCACCATGCAAGATCTCAACACCATGTTGTCTGCAACACATGACAGCCATTGAAG^ 

TGTCCTTCTTGGCCCGGGCTTTTGGGCCGGGGATGCAGGAGGCAGGCCCCGACCCTGTCTTT 

CAGCAGGCCCCCACCCTCCTGAGTGGCAATAT^TAAAATTCGGTATGCTG 
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FIGURE 

MGSGLPLVLLLTLLGSSHGTGPGMTLQLKLKESFLTNSSYESSFLELLEKLCLLLHLPSGTS 
VTLHHARSQHHWCNT 
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FIGURE 259 

AATTGTATCTGTGTAATGTTAAAACAAACGAAATAAAATAGAAGGAAAAACTTTCTGAGTTT 

CAAAAACT^CAGACTAGTACTCTAAAGAACTCTTTAAAACAATTAACTGTTAGGATTGC^ 

TATCATTGGATATTATTTAATTCTGTTTCTGATGTGGGGTTCCTCCACTGTGTTCTGTGTGC 

TATTAATATTTACCATTGCAGAAGCTTCATTCAGTGTTGAAAATGAATGCTTAGTGGATCTG 

TGCCTCTTACGCATATGTTACAAATTATCTGGAGTTCCTAATCAATGCAGAGTTCCCCTCCC 

CTCCGATTGTTCTAAATAATTGAAAGATGTCTGCTGTGGAAAAAGGCATGTATTTAAATCTG 

TATGATTCTCAACCATCTTTAGTTGGGAAAGGTCCTTGAAAGCCAATGGAAATACTTTTTTT 

TTTTCTTGGCACTAATC/^GTGAGTGTTACCTTTTCACTTAGTAGGATGTGTTGTTACGCTA 

GTAAAATAGAAACCTGTGTTTATTCTCAGGTATTTTAGAAACAACAGCCATCATTTTATTTT 

ATGTGTGTGTTCTTGGCTGTATTCATAAATTATATATTTTGGGCTATCAAATATTACTTCAT 

TCAATATAAATAACAATAGTAGAAGTTGTTTACTTAGATATGCTTTCTAGTTGCATTTTCTC 

AGCCTATGTAAGACTACTTTGTTGTAATAGCCTTTGAAATTTACAGTACTGTCTCTCTACTA 

TCTTCAGATTACTTGATTCAAATAAACCAATTATGTTTGTAATTGATATTAATAAAACCAGA 

ATAAAAGTTCATATCTACCC 
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FIGURE 260 

MI GYYL I LFLMWGSSTVFCVLLI FT I AEAS PS VENECLVDLCLLR I CYKLSGVPNQCRVPLP 
SDCSK 
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GAGGATTTGCCACAGCAGCGCATAGAGCAGGAGAGCACCACCGGAGCCCTTGAGACATCCTT 
GAGAAGAGCCACAGCATAAGAGACTGCCCTGCTTGGTGTTTTGCAGGATGATGGTGGCCCTT 
CGAGGAGCTTCTGCATTGCTGGTTCTGTTCCTTGCAGCTTTTCTGCCCCCGCCGCAGTGTAC 
CCAGGACCCAGCCATGGTGCATTACATCTACCAGCGCTTTCGAGTCTTGGAGCAAGGGCTGG 
AAAAATGTACCCAAGCAACGAGGGCATACATTCAAGAATTCCAAGAGTTCTCAAAAAATATA 
TCTGTCATGCTGGGAAGATGTCAGACCTACACAAGTGAGTACAAGAGTGCAGTGGGTAACTT 
GGCACTGAGAGTTGAACGTGCCCAACGGGAGATTGACTACATACAATACCTTCGAGAGGCTG 
ACGAGTGCATCGTATCAGAGGACAAGACACTGGCAGAAATGTTGCTCCAAGAAGCTGAAGAA 
GAGAAAAAGATCCGGACTCTGCTGAATGCAAGCTGTGACAACATGCTGATGGGCATAAAGTC 
TTTGAAAATAGTGAAGAAGATGATGGACACACATGGCTCTTGGATGAAAGATGCTGTCTATA 
ACTCTCCAAAGGTGTACTTATTAATTGGATCCAGAAACAACACTGTTTGGGAATTTGCAAAC 
ATACGGGCATTCATGGAGGATAACACCAAGCCAGCTCCCCGGAAGCAAATCCTAACACTTTC 
CTGGCAGGGAACAGGCCAAGTGATCTACAAAGGTTTTCTATTTTTTCATAACCAAGCAACTT 
CTAATGAGATAATCAAATATAACCTGCAGAAGAGGACTGTGGAAGATCGAATGCTGCTCCCA 
GGAGGGGTAGGCCGAGCATTGGTTTACCAGCACTCCCCCTCAACTTACATTGACCTGGCTGT 
GGATGAGCATGGGCTCTGGGCCATCCACTCTGGGCCAGGCACCCATAGCCATTTGGTTCTCA 
CAAAGATTGAGCCGGGCACACTGGGAGTGGAGCATTCATGGGATACCCCATGCAGAAGCCAG 
GATGCTGAAGCCTCATTCCTCTTGTGTGGGGTTCTCTATGTGGTCTACAGTACTGGGGGCCA 
GGGCCCTCATCGCATCACCTGCATCTATGATCCACTGGGCACTATCAGTGAGGAGGACTTGC 
CCAACTTGTTCTTCCCCAAGAGACCAAGAAGTCACTCCATGATCCATTACAACCCCAGAGAT 
AAGCAGCTCTATGCCTGGAATGAAGGAAACCAGATCATTTACAAACTCCAGACAAAGAGAAA 
GCTGCCTCTGAAGTAATGCATTACAGCTGTGAGAAAGAGCACTGTGGCTTTGGCAGCTGTTC 
TACAGGACAGTGAGGCTATAGCCCCTTCACAATATAGTATCCCTCTAATCACACACAGGAAG 
AGTGTGTAGAAGTGGAAATACGTATGCCTCCTTTCCCAAATGTCACTGCCTTAGGTATCTTC 
CAAGAGCTTAGATGAGAGCATATCATCAGGAAAGTTTCAACAATGTCCATTACTCCCCCAAA 
CCTCCTGGCTCTCAAGGATGACCACATTCTGATACAGCCTACTTCAAGCCTTTTGTTTTACT 
GCTCCCCAGCATTTACTGTAACTCTGCCATCTTCCCTCCCACAATTAGAGTTGTATGCCAGC 
CCCTAATATTCACCACTGGCTTTTCTCTCCCCTGGCCTTTGCTGAAGCTCTTCCCTCTTTTT 
CAT^TGTCTATTGATATTCTCCCATTTTCACTGCCCAACTAAAATACTATTAATATTTCTTT 
CTTTTCTTTTCTTTTTTTTGAGACAAGGTCTCACTATGTTGCCCAGGCTGGTCTCAAACTCC 
AGAGCTCAAGAGATCCTCCTGCCTCAGCCTCCTAAGTACCTGGGATTACAGGCATGTGCCAC 
CACACCTGGCTTAAAATACTATTTCTTATTGAGGTTTAACCTCTATTTCCCCTAGCCCTGTC 
CTTCCACTAAGCTTGGTAGATGTAATAATAAAGTGAAAATATTAACATTTGT^TATCGCTTT 
CCAGGTGTGGAGTGTTTGCACATCATTGAATTCTCGTTTCACCTTTGTGAAACATGCACAAG 
TCTTTACAGCTGTCATTCTAGAGTTTAGGTGAGTAACACAATTACT^AAGTGAAAGATACAGC 
TAGAAAATACTACAAATCCCATAGTTTTTCCATTGCCCAAGGAAGCATCAAATACGTATGTT 
TGTTCACCTACTCTTATAGTCAATGCGTTCATCGTTTCAGCCTAAAAATAATAGTCTGTCCC 
TTTAGCCAGTTTTCATGTCTGCACAAGACCTTTCAATAGGCCTTTCAAATGATAATTCCTCC 
AGAAAACCAGTCTAAGGGTGAGGACCCCAACTCTAGCCTCCTCTTGTCTTGCTGTCCTCTGT 
TTCTCTCTTTCTGCTTTAAATTCAATAAAAGTGACACTGAGCAAAAAAAAAAAAAAA 
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MMVALRGASALLVLFLAAFLPPPQCTQDPAMVHYIYQRFRVLEQGLEKCTQATRAYIQEFQE 

PSKNISVMLGRCQTYTSEYKSAVGNLALRVERAQREIDyiQYLREADECIVSEDKTLAEMLL 

QEAEEEKKIRTLIJIASCDNMIJyiGIKSLKIVKKMMDTHGSWMKDAVYNSPKVYLLIGSRNNTV 

WEFANIRAFMEDNTKPAPRKQILTLSWQGTGQVIYKGFLFFHNQATSNEIIKYNLQKRTVED 

RMLLPGGVGRALVYQHSPSTYIDLAVDEHGLWAIHSGPGTHSHLVLTKIEPGTLGVEHSWDT 

PCRSQDAEASFLLCGVLYWYSTGGQGPHRITCIYDPLGTISEEDLPNLFFPKRPRSHSMIH 
YNPRDKQLYAWNEGNQ I I YKLQTKRKLPLK 
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GGGCGCCCGCGTACTC?^CTAGCTGAGGTGGC7^GTGGTTCCACCAACATGGAGCTCTCGCAGA 
TGTCGGAGCTCATGGGGCTGTCGGTGTTGCTTGGGCTGCTGGCCCTGATGGCGACGGCGGCG 
GTAGCGCGGGGGTGGCTGCGCGCGGGGGAGGAGAGGAGCGGCCGGCCCGCCTGCCAAAAAGC 
AAATGGATTTCCACCTGACAAATCTTCGGGATCCAAGAAGCAGAAACAATATCAGCGGATTC 
GGAAGGAGAAGCCTCAACAACACAACTTCACCCACCGCCTCCTGGCTGCAGCTCTGAAGAGC 
CACAGCGGGAACATATCTTGCATGGACTTTAGCAGCAATGGCAAATACCTGGCTACCTGTGC 
AGATGATCGCACCATCCGCATCTGGAGCACCAAGGACTTCCTGCAGCGAGAGCACCGCAGCA 
TGAGAGCCAACGTGGAGCTGGACCACGCCACCCTGGTGCGCTTCAGCCCTGACTGCAGAGCC 
TTCATCGTCTGGCTGGCCAACGGGGACACCCTCCGTGTCTTCAAGATGACCAAGCGGGAGGA 
TGGGGGCTACACCTTCACAGCCACCCCAGAGGACTTCCCTAAAAAGCACAAGGCGCCTGTCA 
TCGACATTGGCATTGCTAACACAGGGAAGTTTATCATGACTGCCTCCAGTGACACCACTGTC 
CTCATCTGGAGCCTGAAGGGTCAAGTGCTGTCTACCATCAACACCAACCAGATGAACAACAC 
ACACGCTGCTGTATCTCCCTGTGGCAGATTTGTAGCCTCGTGTGGCTTCACCCCAGATGTGA 
AGGTTTGGGAAGTCTGCTTTGGAAAGAAGGGGGAGTTCCAGGAGGTGGTGCGAGCCTTCGAA 
CTAAAGGGCCACTCCGCGGCTGTGCACTCGTTTGCTTTCTCCAACGACTCACGGAGGATGGC 
TTCTGTCTCCAAGGATGGTACATGGAAACTGTGGGACACAGATGTGGAATACAAGAAGAAGC 
AGGACCCCTACTTGCTGAAGACAGGCCGCTTTGAAGAGGCGGCGGGTGCCGCGCCGTGCCGC 
CTGGCCCTCTCCCCCAACGCCCAGGTCTTGGCCTTGGCCAGTGGCAGTAGTATTCATCTCTA 
CAATACCCGGCGGGGCGAGAAGGAGGAGTGCTTTGAGCGGGTCCATGGCGAGTGTATCGCCA 
ACTTGTCCTTTGACATCACTGGCCGCTTTCTGGCCTCCTGTGGGGACCGGGCGGTGCGGCTG 
TTTCACAACACTCCTGGCCACCGAGCCATGGTGGAGGAGATGCAGGGCCACCTGAAGCGGGC 
CTCCAACGAGAGCACCCGCCAGAGGCTGCAGCAGCAGCTGACCCAGGCCCAAGAGACCCTGA 
AGAGCCTGGGTGCCCTGAAGAAGTGACTCTGGGAGGGCCCGGCGCAGAGGATTGAGGAGGAG 
GGATCTGGCCTCCTCATGGCACTGCTGCCATCTTTCCTCCCAGGTGGAAGCCTTTCAGAAGG 
AGTCTCCTGGTTTTCTTACTGGTGGCCCTGCTTCTTCCCATTGAAACTACTCTTGTCTACTT 
AGGTCTCTCTCTTCTTGCTGGCTGTGACTCCTCCCTGACTAGTGGCCAAGGTGCTTTTCTTC 
CTCCCAGGCCCAGTGGGTGGAATCTGTCCCCACCTGGCACTGAGGAGAATGGTAGAGAGGAG 
AGGAGAGAGAGAGAGAATGTGATTTTTGGCCTTGTGGCAGCACATCCTCACACCCAAAGAAG 
TTTGTAAATGTTCCAGAACAACCTAGAGAACACCTGAGTACTAAGCAGCAGTTTTGCAAGGA 
TGGGAGACTGGGATAGCTTCCCATCACAGAACTGTGTTCCATCAAAAAGACACTAAGGGATT 
TCCTTCTGGGCCTCAGTTCTATTTGTAAGATGGAGAATAATCCTCTCTGTGAACTCCTTGCA 
AAGATGATATGAGGCTAAGAGAATATCAAGTCCCCAGGTCTGGAAGAAAAGTAGAAAAGAGT 
AGTACTATTGTCCAATGTCATGAAAGTGGTAAAAGTGGGAACCAGTGTGCTTTGAAACCAAA 
TTAGAAACACATTCCTTGGGAAGGCAAAGTTTTCTGGGACTTGATCATACATTTTATATGGT 
TGGGACTTCTCTCTTCGGGAGATGATATCTTGTTTAAGGAGACCTCTTTTCAGTTCATCAAG 
TTCATCAGATATTTGAGTGCCCACTCTGTGCCCAAATAAATATGAGCTGGGGATTAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA7VAAAAAAAA 
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FIGURE 264 

MELSQMSELMGLSVLLGLL^MATAAVARGWLRAGEERSGRPACQKANGFPPDKSSGSKKQK 

QYQRIRKEKPQQHNFTHRLLAAALKSHSGNISCMDFSSNGKYLATCADDRTIRIWSTKDFLQ 

REHRSMRANVELDHATLWFSPDCRAFIWLANGDTLRVFKMTKREDGGYTFTATPEDFPKK 

HKAPVIDIGIANTGKFIMTASSDTTVLIWSLKGQVLSTINTNQMNNTHAAVSPCGRFVASCG 

FTPDVKWEVCFGKKGEFQEWRAFELKGHSAAVHSFAFSNDSRRI^^VSKDGTWKLWDTDV 

EYKKKQDPYLLKTGRFEEAAGAAPCRLALSPNAQVLALASGSSIHLYNTRRGEKEECFERVH 

GECIANLSFDITGRFIASCGDRAVRLFHNTPGHRAMVEEMQGHLKRASNESTRQRLQQQLTQ 
AQETLKSLGALKK 
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FIGURE 265 

TGGCCTCCCC?VGCTTGCCAGGCACAAGGCTGAGCGGGAGGAAGCGAGAGGCATCTAAGCAGG 
CAGTGTTTTGCCTTCACCCCAAGTGAC CATGA GAGGTGCCACGCGAGTCTCAATCATGCTCC 
TCCTAGTAACTGTGTCTGACTGTGCTGTGATCACAGGGGCCTGTGAGCGGGATGTCCAGTGT 
GGGGCAGGCACCTGCTGTGCCATCAGCCTGTGGCTTCGAGGGCTGCGGATGTGCACCCCGCT 
GGGGCGGGAAGGCGAGGAGTGCCACCCCGGCAGCCACAAGGTCCCCTTCTTCAGGAAACGCA 
AGCACCACACCTGTCCTTGCTTGCCCAACCTGCTGTGCTCCAGGTTCCCGGACGGCAGGTAC 
CGCTGCTCCATGGACTTGAAGAACATCAATTTTTAGGCGCTTGCCTGGTCTCAGGATACCCA 
CCATCCTTTTCCTGAGCACAGCCTGGATTTTTATTTCTGCCATGAAACCCAGCTCCCATGAC 
TCTCCCAGTCCCTACACTGACTACCCTGATCTCTCTTGTCTAGTACGCACATATGCACACAG 
GCAGACATACCTCCCATCATGACATGGTCCCCAGGCTGGCCTGAGGATGTCACAGCTTGAGG 
CTGTGGTGTGAAAGGTGGCCAGCCTGGTTCTCTTCCCTGCTCAGGCTGCCAGAGAGGTGGTA 
AATGGCAGAAAGGACATTCCCCCTCCCCTCCCCAGGTGACCTGCTCTCTTTCCTGGGCCCTG 
CCCCTCTCCCCACATGTATCCCTCGGTCTGAATTAGACATTCCTGGGCACAGGCTCTTGGGT 
GCATTGCTCAGAGTCCCAGGTCCTGGCCTGACCCTCAGGCCCTTCACGTGAGGTCTGTGAGG 
ACCAATTTGTGGGTAGTTCATCTTCCCTCGATTGGTTAACTCCTTAGTTTCAGACCACAGAC 
TCAAGATTGGCTCTTCCCAGAGGGCAGCAGACAGTCACCCCAAGGCAGGTGTAGGGAGCCCA 
GGGAGGCCAATCAGCCCCCTGAAGACTCTGGTCCCAGTCAGCCTGTGGCTTGTGGCCTGTGA 
CCTGTGACCTTCTGCCAGAATTGTCATGCCTCTGAGGCCCCCTCTTACCACACTTTACCAGT 
TAACCACTGAAGCCCCCAATTCCCACAGCTTTTCCATTAAAATGCAAATGGTGGTGGTTCAA 
TCTAATCTGATATTGACATATTAGAAGGCAATTAGGGTGTTTCCTTAAACAACTCCTTTCCA 
AGGATCAGCCCTGAGAGCAGGTTGGTGACTTTGAGGAGGGCAGTCCTCTGTCCAGATTGGGG 
TGGGAGCAAGGGACAGGGAGCAGGGCAGGGGCTGAAAGGGGCACTGATTCAGACCAGGGAGG 
CAACTACACACCAACATGCTGGCTTTAGAATAAAAGCACCAACTGAAA/^ 
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FIGURE 266 

MRGATRVSIMLLLVTVSDCAVITGACERDVQCGAGTCCAISLWLRGLRMCTPLGREGEECHP 
GSHKVPFFRKRKHHTCPCLPNLLCSRFPDGRYRCSMDLKNINF 
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FIGURE 267 

AGCGCCCGGGCGTCGGGGCGGTAAAAGGCCGGCAGAAGGGAGGCACTTGAGAAATGTCTTTC 
CTCCAGGACCCAAGTTTCTTCACCATGGGGATGTGGTCCATTGGTGCAGGAGCCCTGGGGGC 
TGCTGCCTTGGCATTGCTGCnTGCCAACACAGACGTGTTTCTGTCCAAGCCCCAGAAAGCGG 
CCCTGGAGTACCTGGAGGATATAGACCTGAAAACACTGGAGAAGGAACCAAGGACTTTCAAA 
GCAAAGGAGCTATGGGAAAAAAATGGAGCTGTGATTATGGCCGTGCGGAGGCCAGGCTGTTT 
CCTCTGTCGAGAGGAAGCTGCGGATCTGTCCTCCCTGAAAAGCATGTTGGACCAGCTGGGCG 
TCCCCCTCTATGCAGTGGTAAAGGAGCACATCAGGACTGAAGTGAAGGATTTCCAGCCTTAT 
TTCAT^GGAGAAATCTTCCTGGATGAAA^VGAAAAAGTTCTATGGTCCACAAAGGCGGAAGAT 
GATGTTTATGGGATTTATCCGTCTGGGAGTGTGGTACAACTTCTTCCGAGCCTGGAACGGAG 
GCTTCTCTGGAAACCTGGAAGGAGAAGGCTTCATCCTTGGGGGAGTTTTCGTGGTGGGATCA 
GGAAAGCAGGGCATTCTTCTTGAGCACCGAGAAAAAGAATTTGGAGACAAAGTAAACCTACT 
TTCTGTTCTGGAAGCTGCTT^GATGATCAAACCACAGACTTTGGCCTCAGAGAAAAAATGAT 
TGTGTGAAACTGCCCAGCTCAGGGATAACCAGGGACATTCACCTGTGTTCATGGGATGTATT 
GTTTCCACTCGTGTCCCTAAGGAGTGAGAAACCCATTTATACTCTACTCTCAGTATGGATTA 
TTAATGTATTTTAATATTCTGTTTAGGCCCACTAAGGCAAAATAGCCCCAAAACAAGACTGA 
C?^AAAATCTGAAAAACTAATGAGGATTATTAAGCTAAAACCTGGGAAATAGGAGGCTTAAAA 
TTGACTGCCAGGCTGGGTGCAGTGGCTCACACCTGTAATCCCAGCACTTTGGGAGGCCAAGG 
TGAGCAAGTCACTTGAGGTCGGGAGTTCGAGACCAGCCTGAGCAACATGGCGAAACCCCGTC 
TCTACTAAAAATACAAAAATCACCCGGGTGTGGTGGCAGGCACCTGTAGTCCCAGCTACCCG 
GGAGGCTGAGGCAGGAGAATCACTTGAACCTGGGAGGTGGAGGTTGCGGTGAGCTGAGATCA 
CACCACTGTATTCCAGCCTGGGTGACTGAGACTCTAACTAA 
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FIGURE 268 

MSFLQDPSFFTMGMWSIGAGALGAAALALLLANTDVFLSKPQKAALEYLEDIDLKTLEKEPR 
TFKAKELVffiKNGAVIMA\nUlPGCFLCREEAADLSSLKSMLDQLGVPLYAVVKEHIRTEVKDF 
QPYFKGEIFLDEKKKFYGPQRRKMMFMGFIRLGVWYNFFRAWNGGFSGNLEGEGFILGGVFV 
VGSGKQGILLEHREKEFGDKVNLLSVLEAAKMIKPQTLASEKK 
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FIGURE 269 

ACGGACCGAGGGTTCGAGGGAGGGACACGGACCAGGAACCTGAGCTAGGTCAAAGACGCCCG 

GGCCAGGTGCCCCGTCGCAGGTGCCCCTGGCCGGAGATGCGGTAGGAGGGGCGAGCGCGAGA 

AGCCCCTTCCTCGGCGCTGCCAACCCGCCACCCAGCCCATGGCGAACCCCGGGCTGGGGPTG 

CTTCTGGCGCTGGGCCTGCCGTTCCTGCTGGCCCGCTGGGGCCGAGCCTGGGGGCAAATACA 

GACCACITCTGCy^TGAGAATAGCACTGTTTTGCCTTCATCCACCAGCTCCAGCT 

GCAACCTGCGTCCGGAAGCCATCACTGCTATCATCGTGGTCTTCTCCCTCTTGGCTGCCTTG 

CTCCTGGCTGTGGGGCTGGCACTGTTGGTGCGGAAGCTTCGGGAGAAGCGGCAGACGGAGGG 

CACCTACCGGCCCAGTAGCGAGGAGCAGTTCTCCCATGCAGCCGAGGCCCGGGCCCCTCAGG 

ACTCCAAGGAGACGGTGCAGGGCTGCCTGCCCATCIASGTCCCCTCTCCTGCATCTGTCTCC 

CTTCATTGCTGTGTGACCTTGGGGAAAGGCAGTGCCCTCTCTGGGCAGTCAGATCCACCCAG 

TGCTTAATAGCAGGGAAGAAGGTACTTCAAAGACTCTGCCCCTGAGGTCAAGAGAGGATGGG 

GCTATTCACTTTTATATATTTATATAAAATTAGTAGTGAGATGTAAAAAAAAAAAAAAAAAA 
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FIGURE 270 



MANPGLGLLLALGLPFLLARWGRAWGQIQTTSANENSTVLPSSTSSSSDGNLRPEAITAIIV 
WSLLT^LIAVGLALLVRKLREKRQTEGTYRPSSEEQFSHAT^EARAPQDSK^ 
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AATATATCATCTATTTATCATTAATCAATAATGTATTCTTTTATTCCAATAACATTTGGGTT 
TTGGGATTTTAATTTTCAAACACAGCAGA ATGA CATTTTTTCTGTCACTATTATTATTGTTG 
GTATGTGAAGCTATTTGGAGATCCAATTCAGGAAGCAACACATTGGAGAATGGCTACTTTCT 
ATCAAGJ\AATAAAGAGAACCACAGTCAACCCACACAATCATCTTTAGAAGACAGTGTGACTC 
CTACOy^GCTGTCAAAACCACAGGCAAGGGCATAGTTAAAGGACGGAATCTTGACTCAAGA 
GGGTTAATTCTTGGTGCTGAAGCCTGGGGCAGGGGTGTAAAGAAAAACACTTAGATTCAATG 
ATTGTAAATTTAAGGCAAATACACATATTAGTATTACCTTAGTGTAATGTATCCCTGTCATA 
TATACAATAAGGTGATU^TTATAAGTACCOTATGCAGTTGGCTGGACAGTTCTAAATTGGACT 
TTATTAATTTTTAAAATCAGTAACTGATTTATCACTGGCTATGTGCTTAGATCTACAGGAGA 
TCATATAATTTGATACAAATAAAAGAAAAGTGTTCTCTCCCCTTACAGAATTGACATTTTAA 
ATGCGATACAGTTAGAATAGGAAATATGACATTAGAAAGGAAGAATGACAGGGAGAAAGGAA 
AGAAGGGAAAATGTTGCCAAGGAAAAAAAAA 
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FIGURE 272 

MTFFLSLLLLLVCEAIWRSNSGSNTLENGYFLSRNKENHSQPTQSSLEDSVTPTKAVKTTGK 
GIVKGRNLDSRGLILGAEAWGRGVKKNT 
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FIGURE 273 

GCCAGGAATAACTAGAGAGGT^CAATGGGGTTATTCAGAGGTTTTGTTTTCCTCTTAGTTCT 

GTGCCTGCTGCACCAGTCAAATACTTCCTTCATTAAGCTGAATAATAATGGCTTTGAAGATA 

TTGTCATTGTTATAGATCCTAGTGTGCCAGAAGATGAAAAAATAATTGAACAAATAGAGGAT 

ATGGTGACTACAGCTTCTACGTACCTGTTTGAAGCCACAGAAAAAAGATTTTTTTTCAAAAA 

TGTATCTATATTAATTCCTGAGAATTGGAAGGAAAATCCTCAGTACAAAAGGCCAAAACATG 

AAAACCATAAACATGCTGATGTTATAGTTGCACCACCTACACTCCCAGGTAGAGATGAACCA 

TACACCAAGCAGTTava^GAATGTGGAGAGAAAGGCGAATAO^TTCACrTCACCCCT 

TCTACTTGGAAAAAAACAAAATGAATATGGACCACCAGGCAAACTGTTTGTCCATGAGTGGG 

CTCACCTCCGGTGGGGAGTGTTTGATGAGTACAATGAAGATCAGCCTTTCTACCGTGCTAAG 

TCAAAAAAAATCGAAGCAACAAGGTGTTCCGCAGGTATCTCTGGTAGAAATAGAGTTTATAA 

GTGTCAAGGAGGCAGCTGTCTTAGTAGAGCATGCAGAATTGATTCTACAACAAAACTGTATG 

GAAAAGATTGTCAATTCTTTCCTGATAAAGTACAAACAGAAAAAGCATCCATAATGTTTATG 

CAAAGTATTGATTCTGTTGTTGAATTTTCTAACGAAAAAACCCATAATCAAGAAGCTCCAAG 

CCTACAAAACATAAAGTGCAATTTTAGAAGTACATGGGAGGTGATTAGCAATTCTGAGGATT 

TTAAAAACACCATACCCATGGTGACACCACCTCCTCCACCTGTCTTCTCATTGCTG/U^GATC 

AGTCAAAGAATTGTGTGCTTAGTTCTTGATAAGTCTGGAAGCATGGGGGGTAAGGACCGCCT 

AAATCGAATGAATCAAGCAGCAAAACATTTCCTGCTGCAGACTGTTGAAAATGGATCCTGGG 

TGGGGATGGTTCACTTTGATAGTACTGCCACTATTGTAAATAAGCTAATCCAAATAAAAAGC 

AGTGATGAAAGAAACACACTCATGGCAGGATTACCTACATATCCTCTGGGAGGAACTTCCAT 

CTGCTCTGGAATTAAATATGCATTTCAGGTGATTGGAGAGCTACATTCCCAACTCGATGGAT 

CCGAAGTACTGCTGCTGACTGATGGGGAGGATAACACTGCAAGTTCTTGTATTGATGAAGTG 

AAACAAAGTGGGGCCATTGTTCATTTTATTGCTTTGGGAAGAGCTGCTGATGAAGCAGTAAT 

AGAGATGAGCAAGATAACAGGAGGAAGTCATTTTTATGTTTCAGATGAAGCTCAGAACAATG 

GCCTCATTGATGCTTTTGGGGCTCTTACATCAGGAAATACTGATCTCTCCCAGTVAGTCCCTT 

CAGCTCGAAAGTAAGGGATTAACACTGAATAGTAATGCCTGGATGAACGACACTGTCATAAT 

TGATAGTACAGTGGGAAAGGACACGTTCTTTCTCATCACATGGAACAGTCTGCCTCCCAGTA 

TTTCTCTCTGGGATCCCAGTGGAACAATAATGGAAAATTTCACAGTGGATGCAACTTCCAAA 

ATGGCCTATCTCAGTATTCCAGGAACTGCAAAGGTGGGCACTTGGGCATACAATCTTCAAGC 

CAAAGCGAACCCAGAAACATTAACTATTACAGTAACTTCTCGAGCAGCAAATTCTTCTGTGC 

CTCCAATCACAGTGAATGCTAAAATGAATAAGGACGTAAACAGTTTCCCCAGCCCAATGATT 

GTTTACGCAGAAATTCTACAAGGATATGTACCTGTTCTTGGAGCCAATGTGACTGCTTTCAT 

TGAATCACAGAATGGACATACAGAAGTTTTGGAACTTTTGGATAATGGTGCAGGCGCTGATT 

CTTTCAAGAATGATGGAGTCTACTCCAGGTATTTTACAGCATATACAGAAAATGGCAGATAT 

AGCTTAAAAGTTCGGGCTCATGGAGGAGCAAACACTGCCAGGCTAAAATTACGGCCTCCACT 

GAATAGAGCCGCGTACATACCAGGCTGGGTAGTGAACGGGGAAATTGAAGCAAACCCGCCAA 

GACCTGAAATTGATGAGGATACTCAGACCACCTTGGAGGATTTCAGCCGAACAGCATCCGGA 

GGTGCATTTGTGGTATCACAAGTCCCAAGCCTTCCCTTGCCTGACCAATACCCACCAAGTCA 

AATCACAGACCTTGATGCCACAGTTCATGAGGATAAGATTATTCTTACATGGACAGCACCAG 

GAGATAATTTTGATGTTGGAAAAGTTCAACGTTATATCATAAGAATAAGTGCAAGTATTCTT 

GATCTAAGAGACAGTTTTGATGATGCTCTTCAAGTAAATACTACTGATCTGTCACCAAAGGA 

GGCCAACTCCAAGGAAAGCTTTGCATTTAAACCAGAAAATATCTCAGAAGAAAATGCAACCC 

ACATATTTATTGCCATTAAAAGTATAGATAAAAGCAATTTGACATCT^AAAGTATCCAACATT 

GCACAAGTAACTTTGTTTATCCCTCAAGCA/U^TCCTGATGACATTGATCCTACACCTACTCC 

TACTCCTACTCCTACTCCTGATAAAAGTCATAATTCTGGAGTTAATATTTCTACGCTGGTAT 

TGTCTGTGATTGGGTCTGTTGTAATTGTTAACTTTATTTTAAGTACCACCATTTGAACCTTA 

ACGAAGAAAAAAATCTTCAAGTAGACCTAGAAGAGAGTTTTAAAAAACAAAACAATGTAAGT 

AAAGGATATTTCTGAATCTTAAAATTCATCCCATGTGTGATCATAAACTCATAAAAATAATT 

TTAAGATGTCGGAAAAGGATACTTTGATTAAATAAAAACACTCATGGATATGTAAAAACTGT 

CAAGATTAAAATTTAATAGTTTCATTTATTTGTTATTTTATTTGTAAGAAATAGTGATGAAC 

AAAGATCCTTTTTCATACTGATACCTGGTTGTATATTATTTGATGCAACAGTTTTCTGAAAT 

GATATTTCAAATTGCATCAAGAAATTAAAATCATCTATCTGAGTAGTCAAAATACAAGTAAA 

GGAGAGCAAATAAACAACATTTGGAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

AAAAAAAAAAAAAAAAAAAT^AAAAAAAAAAAAAAAAAAAAA 



BNSOOCtO: <WO_ 9963088A2 I > 



wo 99/63088 _o / PCTAJS99/12252 

FIGURE 274 

MGLFRGFVFLLVLCLLHQSNTSFIKUONNGFEDIVIVIDPSVPEDEKIIEQIEDMVTTASTY 

LFEATEKRFFFKNVSILIPENWKENPQYKRPKHENHKHADVIVAPPTLPGRDEPYTKQFTEC 

GEKGEYIHFTPDLLLGKKQNEYGPPGKLFVHEWAHLRWGVFDEYNEDQPFYRAKSKKIEATR 

CSAGISGRNRVYKCQGGSCLSRACRIDSTTKLYGKDCQFFPDKVQTEKASIMFMQSIDSWE 

FCNEKTHNQEAPSLQNIKCNFRSTWEVISNSEDFKNTIPMVTPPPPPVFSLLKISQRIVCLV 

LDKSGSMGGKDRLNRMNQAAKHFLLQTVENGSWVGMVHFDSTATIVNKLIQIKSSDERNTLM 

AGLPTYPLGGTSICSGIKYAFQVIGELHSQLDGSEVLLLTDGEDNTASSCIDEVKQSGAIVH 

FIALGRAADEAVIEMSKITGGSHFYVSDEAQNNGLIDAFGALTSGNTDLSQKSLQLESKGLT 

LNSNAWMNDTVI IDSTVGKDTFFLI TWNSLPPS I SLWDPSGT IMENFTVDATSKMAYLS I PG 

TAKVGTWAYNLQAKANPETLTITVTSRAANSSVPP I TVNAKMNKDVNS FPSPMI VYAE I LQG 

YVPVLGANVTAFIESQNGHTEVLELLDNGAGADSFKNDGVYSRYFTAYTENGRYSLKVRAHG 

GANTARLKLRPPLNRAAYIPGWWNGEIEANPPRPEIDEDTQTTLEDFSRTASGGAFWSQV 

PSLPLPDQYPPSQITDLDATVHEDKI ILTWTAPGDNFDVGKVQRYI IRISASILDLRDSFDD 

ALQVNTTDLSPKEANSKESFAFKPENISEENATHIFIAIKSIDKSNLTSKVSNIAQVTLFIP 

QANPDDIDPTPTPTPTPTPDKSHNSGVNISTLVLSVIGSWIVNFILSTTI 
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FIGURE 275 

CTCCTTAGGTGGAAACCCTGGGAGTAGAGTACTGACAGCAAAGACCGGGAAAGACCATACGTCCCCGG 

GCAGGGGTGACAACAGGTGTCATCTTTTTGATCTCGTGTGTGGCTGCCTTCCTATTTCAAGGA^ 

GCC/AGGTAATTTTGACCCAGAGGAGCAATGATGTAGCaVCCrCCTAACOT 

AGTTATGCCAGGATTTACTAGAGAGTGTCAACTCAACCAGCAAGCGGCTCCTTCGGCTTT^CTT 

TTGGAGGAGAGAACCTTTGTGGGGCTGCGTTCTCTTAGCAGTGCrCAGAAGTGACITGCCTGAG^ 

GACCAGAAGAAAGGAAAGGTCCCCTCTTGCTGTTGGCTGCACATCAGGAAGGCTGTGATGGGAATGAA 

GGTGAAAACTTGGAGATTTCACTTCAGTCATTGCTTCTGCCTGCAAGATCATCCTTTAAA^ 

AGCTGCTCTGTGTGGTGGTTAACTCCAAGAGGCAGAACTCGTTCTAGAAGGAAATGGATGCAAGCAGC 

TCCGGGGGCCCCAAACGCATGCTTCCTGTGGTCTAGCCCAGGGAAGCCCTTCCGTGGGGGCCCCGGCT 

TTGAGGGATGCCACCGGTTCTGGACGCATGGCTGATTCCTGAATGATGATGGTTCGCCGGGGGCTGCT 

TGCGTGGATTTCCCGGGTGGTGGTTTTGCTGGTGCTCCTCTGCTGTGCTATCTCTGTCCTGTACATGT 

TGGCCTGCACCCCAAAAGGTGACGAGGAGCAGCTGGCACTGCCCAGGGCCAACAGCCCCACGGGGAAG 

GAGGGGTACC7VGGCCGTCCTTCAGGAGTGGGAGGAGCAGCACCGCAACTACGTGAGCAGCCTGAAGCG 

GCAGATCGCACAGCTCAAGGAGGAGCTGCAGGAGAGGAGTGAGCAGCTCAGGAATGGGCAGTACCAAG 

CCAGCGATGCTGCTGGCCTGGGTCTGGACAGGAGCCCCCCAGAGAA/^CCCAGGCCGACCTCCTGGCC 

TTCCTGCACTCGCAGGTGGACAAGGCAGAGGTGAATGCTGGCGTCAAGCTGGCCACAGAGTATGCAGC 

AGTGCCTTTCGATAGCTTTACTCTACAGAAGGTGTACCAGCTGGAGACTGGCCTTACCCGCCACCCCG 

AGGAGAAGCCTGTGAGGAAGGACAAGCGGGATGAGTTGGTGGAAGCCATTGAATCAGCCTTGGAGACC 

CTGAACAATCCTGCAGAGAACAGCCCCAATCACCGTCCTTACACGGCCTCTGATTTCATAGAAGGGAT 

CTACCGAACAGAAAGGGACAAAGGGACATTGTATGAGCTCACCTTCAAAGGGGACCACAAACACGAAT 

TCAAACGGCTCATCTTATTTCGACCATTCAGCCCCATCATGAAAGTGAAAAATGAAAAGCTCAACATG 

GCCAACACGCTTATCAATGTTATCGTGCCTCTAGCAAAAAGGGTGGACAAGTTCCGGCAGTTCATGCA 

GAATTTCAGGGAGATGTGCATTGAGCAGGATGGGAGAGTCCATCTCACTGTTGTTTACTTTGGGAAAG 

AAGAAATAAATGAAGTCAAAGGAATACTTGAAAACACTTCCAAAGCTGCCAACTTCAGGAACTTTACC 

TTCATCCAGCTGAATGGAGAATTTTCTCGGGGAAAGGGACTTGATGTTGGAGCCCGCTTCTGGAAGGG 

AAGCAACGTCCTTCTCTTTTTCTGTGATGTGGACATCTACTTCACATCTGAATTCCTCAATACGTGTA 

GGCTGAATACACAGCCAGGGAAGAAGGTATTTTATCCAGTTCTTTTCAGTCAGTACAATCCTGGCATA 

ATATACGGCCACCATGATGCAGTCCCTCCCTTGGAACAGCAGCTGGTCATAAAGAAGGAAACTGGATT 

TTGGAGAGACTTTGGATTTGGGATGACGTGTCAGTATCGGTCAGACTTCATCAATATAGGTGGGTTTG 

ATCTGGACATCAAAGGCTGGGGCGGAGAGGATGTGCACCTTTATCGCAAGTATCTCCACAGCAACCTC 

ATAGTGGTACGGACGCCTGTGCGAGGACTCTTCCACCTCTGGCATGAGAAGCGCTGCATGGACGAGCT 

GACCCCCGAGCAGTACAAGATGTGCATGCAGTCCAAGGCCATGAACGAGGCATCCCACGGCCAGCTGG 

GCATGCTGGTGTTCAGGCACGAGATAGAGGCTCACCTTCGCAAACAGAAACAGAAGACAAGTAGCAAA 

AAAACA TGA ACTCCCAGAGAAGGATTGTGGGAGACACTTTTTCTTTCCTTTTGCAATTACTGAAAGTG 

GCTGCAACAGAGAAAAGACTTCCATAAAGGACGACAAAAGAATTGGACTGATGGGTCAGAGATGAGAA 

AGCCTCCGATTTCTCTCTGTTGGGCTTTTTACAACAGAAATCAAAATCTCCGCTTTGCCTGCAAAAGT 

AACCCAGTTGCACCCTGTGAAGTGTCTGACAAAGGCAGAATGCTTGTGAGATTATAAGCCTAATGGTG 

TGGAGGTTTTGATGGTGTTTACAATACACTGAGACCTGTTGTTTTGTGTGCTCATTGAAATATTCATG 

ATTTAAGAGCAGTTTTGTAAAAAATTCATTAGCATGAAAGGCAAGCATATTTCTCCTCATATGAATGA 

GCCTATCAGCAGGGCTCTAGTTTCTAGGAATGCTAAAATATCAGAAGGCAGGAGAGGAGATAGGCTTA 

TTATGATACTAGTGAGTACATTAAGTAAAATAAAATGGACCAGAAAAGAAAAGAAACCATAAATATCG 

TGTCATATTTTCCCCAAGATTAACCAAAAATAATCTGCTTATCTTTTTGGTTGTCCTTTTAACTGTCT 

CCGTTTTTTTCTTTTATTTAAAAATGCACTTTTTTTCCCTTGTGAGTTATAGTCTGCTTATTTAATTA 

CCACTrrGCAAGCCTTACAAGAGAGCACAAGTTGGCCTACATTTTTATATTTTTTAAGAAGATACTTT 

GAGATGCATTATGAGAACTTTCAGTTCAAAGCATCAAATTGATGCCATATCCAAGGACATGCCAAATG 

CTGATTCTGTCAGGCACTGAATGTCAGGCATTGAGACATAGGGAAGGAATGGTTTGTACTAATACAGA 

CGTACAGATACTTTCTCTGAAGAGTATTTTCGAAGAGGAGCAACTGAACACTGGAGGAAAAGAAAATG 

ACACTTTCTGCTTTACAGAAAAGGAAACTCATTCAGACTGGTGATATCGTGATGTACCTAAAAGTCAG 

AAACCACATTTTCTCCTCAGAAGTAGGGACCGCTTTCTTACCTGTTTAAATAAACCAAAGTATACCGT 

GTGAACCAAACAATCTCTTTTCAAAACAGGGTGCTCCTCCtGGCTTCTGGCTTCCATAAGAAGAAATG 

GAGAAAAATATATATATATATATATATATTGTGAT^GATCAATCCATCTGCCAGAATCTAGTGGGATG 

GAAGTTTTTGCTACATGTTATCCACCCCAGGCCAGGTGGAAGTAACTGAATTATTTTTTAAATTAAGC 

AGTTCTACTCAATCACCAAGATGCTTCTGAAAATTGCATTTTATTACCATTTCAAACTATTTTTTAAA 

AATAAATACAGTTAACATAGAGTGGTTTCTTCATTCATGTGAAAATTATTAGCCAGCACCAGATGCAT 

GAGCTAATTATCTCTTTGAGTCCTTGCTTCTGTTTGCTCACAGTAAACrCATTGTTTAAAAGC^ 

GAACATTCAAGCTGTTGGTGTGTTAAAAAATGCATTGTATTGATTTGTACTGGTAGTTTATGAAATTT 

AATTAAAACACAGGCCATGAATGGAAGGTGGTATTGCACAGCTAATAAAATATGATTTGTGGATAT 
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MMMVRRGLLAWI SRVWLLVLLCCAI SVLYMLACTPKGDEEQLALPRANS PTGKEGYQAVLQ 
EWEEQHRNYVSSLKRQIAQLKEELQERSEQLRNGQYQASDAAGLGLDRSPPEKTQADLLAFL 
HSQVDKAEWAGVKLATEYAAVPFDSFTLQKVYQLETGLTRHPEEKPVRKDKRDELVEAIES 
ALETLNNPAENSPNHRPYTASDFIEGIYRTERDKGTLYELTFKGDHKHEFKRLILFRPFSPI 
MKVKNEKLNMANTL INVI VPLAKRVDKFRQFMQNFREMC I EQDGRVHLTWYFGKEE I NEVK 
GILENTSKAANFRNFTFIQLNGEFSRGKGLDVGARFWKGSNVLLFFCDVDIYFTSEFLNTCR 
LNTQPGKKVFYPVLFSQYNPGI I YGHHDAVPPLEQQLVI KKETGFWRDFGFGMTCQYRSDFI 
N I GGFDLD I KGWGGEDVHLYRKYLHSNLI WRTPVRGLFHLWHEKRCMDELTPEQYKMCMQS 
KAMNEASHGQLGMLVFRHEIEAHLRKQKQKTSSKKT 
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FIGURE 277 

GAAAGAATGTTGTGGCTGCTCTTTTTTCTGGTGACTGCCATTCATGCT 

AGGTGCAGAAAATGCTTTTA/^GTGAGACTTAGTATCAGAACAGCTCTGGGAGATAAAGCAT 

ATGCCTGGGATACCAATGAAGAATACCTCTTCAAAGCGATGGTAGCTTTCTCCATGAGAAAA 

GTTCCCAACAGAGAAGCAACAGAAATTTCCCATGTCCTACTTTGCAATGTAACCCAGAGGGT 

ATCATTCTGGTTTGTGGTTACAGACCCTTCAAAAAATCACACCCTTCCTGCTGTTGAGGTGC 

AATCAGCCATAAGAATGAACAAGAACCGGATCAACAATGCCTTCTTTCTAAATGACCAAACT 

CTGGAATTTTTAAAAATCCCTTCCACACTTGCACCACCCATGGACCCATCTGTGCCCATCTG 

GATTATTATATTTGGTGTGATATTTTGCATCATCATAGTTGCAATTGCACTACTGATTTTAT 

CAGGGATCTGGCAACGTAGAAGAAAGAACAAAGAACCATCTGAAGTGGATGACGCTGAAGAT 

AAGTGTGAAAACATGATCACAATTGAAAATGGCATCCCCTCTGATCCCCTGGACATGAAGGG 

GGGCATATTAATGATGCCTTCATGACAGAGGATGAGAGGCTCACCCCTCTCTGAAGGGCTGT 

TGTTCTGCTTCCTCAAGAAATTAAACATTTGTTTCTGTGTGACTGCTGAGCATCCTGAT^TA 

CCAAGAGCAGATCATATATTTTGTTTCACCATTCTTCTTTTGTAATAAATTTTGAATGTGCT 

TGAAAGTGAAAAGCAATCAATTATACCCACCAACACCACTGAAATCATAAGCTATTCACGAC 

TCAAAATATTCTAAAATATTTTTCTGACAGTATAGTGTATAAATGTGGTCATGTGGTATTTG 

TAGTTATTGATTTAAGCATTTTTAGAAATAAGATCAGGCATATGTATATATTTTCACACTTC 

AAAGACCTAAGGAAT^TAAATTTTCCAGTGGAGAATACATATAATATGGTGTAGAAATCAT 

TGAAAATGGATCCTTTTTGACGATCACTTATATCACTCTGTATATGACTAAGTAAACAAAAG 

TGAGAAGTAATTATTGTAAATGGATGGATAAAAATGGAATTACTCATATACAGGGTGGAATT 

TTATCCTGTTATCACACCAACAGTTGATTATATATTTTCTGAATATCAGCCCCTAATAGGAC 

AATTCTATTTGTTGACCATTTCTACAATTTGTAAAAGTCCAATCTGTGCTAACTTAATAAAG 

TAATAATCATCTCTTTTTAAAAAAAAAA/^AAAAAAAAAAAAAAA 
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FIGURE 278 

MLWLLFFLVTAIHAELCQPGAENAFKVRLSIRTALGDKAYAWDTNEEYLFKAMVAFSMRKVP 
NREATEISHVLLClSnn^QRVSFWFVVTDPSKiraTLPAVEVQSAIR^ 

FLKIPSTLAPPMDPSVPIWIIIFGVIFCIIIVAIALLILSGIWQRRRKNKEPSEVDDAEDKC 
ENMITIENGIPSDPLDMKGGILMMPS 
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FIGURE 279 

AACTCAAACTCCTCTCTCTGGGAAAACGCGGTGCTTGCTCCTCCCGGAGTGGCCTTGGCAGG 
GTGTTGGAGCCCTCGGTCTGCCCCGTCCGGTCTCTGGGGCCAAGGCTGGGTTTCCCTCATGT 
ATGGCAAGAGCTCTACTCGTGCGGTGCTTCTTCTCCTTGGCATACAGCTCACAGCTCTTTGG 
CCTATAGCAGCTGTGGAAATTTATACCTCCCGGGTGCTGGAGGCTGTTAATGGGACAGATGC 
TCGGTTAAAATGCACTTTCTCCAGCTTTGCCCCTGTGGGTGATGCTCTAACAGTGACCTGGA 
ATTTTCGTCCTCTAGACGGGGGACCTGAGCAGTTTGTATTCTACTACCACATAGATCCCTTC 
CAACCCATGAGTGGGCGGTTTAAGGACCGGGTGTCTTGGGATGGGAATCCTGAGCGGTACGA 
TGCCTCCATCCTTCTCTGGAAACTGCAGTTCGACGACAATGGGACATACACCTGCCAGGTGA 
AGAACCCACCTGATGTTGATGGGGTGATAGGGGAGATCCGGCTCAGCGTCGTGCACACTGTA 
CGCTTCTCTGAGATCCACTTCCTGGCTCTGGCCATTGGCTCTGCCTGTGCACTGATGATCAT 
AATAGTAATTGTAGTGGTCCTCTTCCAGCATTACCGGAAAAAGCGATGGGCCGAAAGAGCTC 
ATAAAGTGGTGGAGATAAAATCAAAAGAAGAGGAAAGGCTCAACCAAGAGAAAAAGGTCTCT 
GTTTATTTAGAAGACACAGACTAACAATTTTAGATGGAAGCTGAGATGATTTCCAAGAACAA 
GAACCCTAGTATTTCTTGAAGTTAATGGAAACTTTTCTTTGGCTTTTCCAGTTGTGACCCGT 
TTTCCAACCAGTTCTGCAGCATATTAGATTCTAGACAAGCAACACCCCTCTGGAGCCAGCAC 
AGTGCTCCTCCATATCACCAGTCATACACAGCCTCATTATTAAGGTCTTATTTAATTTCAGA 
GTGTAAATTTTTTCAAGTGCTCATTAGGTTTTATAAACAAGAAGCTACATTTTTGCCCTTAA 
GACACTACTTACAGTGTTATGACTTGTATACACATATATTGGTATCAAAGGGGATAAAAGCC 
AATTTGTCTGTTACATTTCCTTTCACGTATTTCTTTTAGCAGCACTTCTGCTACTAAAGTTA 
ATGTGTTTACTCTCTTTCCTTCCCACATTCTCAATTAAAAGGTGAGCTAAGCCTCCTCGGTG 
TTTCTGATTAACAGTAAATCCTAAATTCAAACTGTTAAATGACATTTTTATTTTTATGTCTC 
TCCTTAACTATGAGACACATCTTGTTTTACTGAATTTCTTTCAATATTCCAGGTGATAGATT 
TTTGTCG 
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FIGURE 280 

MYGKSSTRAVLLLLG IQLTALWP I AAVE I YTSRVLEAVNGTDARLKCTFSSFAPVGDALTVT 
WNFRPLDGGPEQFVFYYHIDPFQPMSGRFKDRVSWDGNPERYDASILLWKLQFDDNGTYTCQ 
VKNPPDVDGVIGEIRLSWHTVRFSEIHFLALAIGSACALMI 1 1 VI VWLFQHYRKKRWAER 
AHKWEIKSKEEERLNQEKKVSVYLEDTD 
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FIGURE 281 
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GCATTTTTGTCTGTGCTCCCTGATCTTCAGGTCACCACCATGAAGTTCTTAGCAGTCCTGGT 

ACTCTTGGGAGTTTCCATCTTTCTGGTCTCTGCCCAGAATCCGACAACAGCTGCTCCAGCTG 

ACACGTATCCAGCTACTGGTCCTGCTGATGATGAAGCCCCTGATGCTGAAACCACTGCTGCT 

GCAACCACTGCGACCACTGCTGCTCCTACCACTGCAACCACCGCTGCTTCTACCACTGCTCG 

TAAAGACATTCCAGTTTTACCCAAATGGGTTGGGGATCTCCCGAATGGTAGAGTGTGTCCCT 

GAGATGGAATCAGCTTGAGTCTTCTGCAATTGGTCACAACTATTCATGCTTCCTGTGATT^ 

ATCCAACTACTTACCTTGCCTACGATATCCCCTTTATCTCTAATCAGTTTATTTTCTTTCAA 

ATAAAAAATAACTATGAGCAACATAAAAAAAAAAAAA 
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FIGURE 282 



MKFLAVLVLLGVS I FLVSAQNPTTAAPADTYPATGP7UDDEAPDAETTAAATTATTAAPTTAT 
TAASTTARKDI PVLPKWVGDLPNGRVCP 
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FIGURE 283 

GGACTCTGAAGGTCCCAAGCAGCTGCTGAGGCCCCCAAGGAAGTGGTTCCAACCTTGGACCC 
CTAGGGGTCTGGATTTGCTGGTTAACAAGATAACCTGAGGGCAGGACCCCATAGGGGAATGC 
TACCTCCTGCCCTTCCACCTGCCCTGGTGTTCACGGTGGCCTGGTCCCTCCTTGCCGAGAGA 
GTGTCCTGGGTCAGGGACGCAGAGGACGCTCACAGACTCCAGCCCTTTGTTACCGAGAGGAC 
ACTTGGCAAGGTCCAGCGATGGTCCGGAGTCCACACACAGACTGGCGGCAGGGCAGGAGGGG 
GACAGTTCTGTTGTGCTTGGTTGGACAGTAAGAGGGTCTTGGCCAGTCCAGGGTGGGGGGCG 
GCAAACTCCATAAAGAACCAGAGGGTCTGGGCCCCGGCCACAGAGTCATCTGCCCAGCTCCT 
CTGCTGCTGGCCAGTGGGAGTGGCACGAGGTGGGGCTTTGTGCCAGTAAAACCACAGGCTGG 
ATTTGCCTGCGGGCCATGGTCCCTGTCTAGGGCAGCAATTCTCAACCTTCTTGCTCTCAGGA 
CCCCAAAGAGCTTTCATTGTATCTATTGATTTTTACCACATTAGCAATTAAAACTGAGAAAT 
GGGCCGGGCACGGTGGCTCACGCCTGTAATCCCAGCACTTTGGGAGGCCGAGGCGGGTGGAT 
CACCTGAGATCAGGAGTTCAAGACCAGCCTGGCCAACATGGTGAAACCTTGTCTACTAAAAA 
TACAAAAAATTAGCCAGGCACAGTGGTGTGCACTGGTAGTCCCAGTTACTCGGGAGGCTGAG 
GCAGGAAAATCGCTTGAACCCAGGAGGCGGACGTTGCGGTGAGCCGAGATCGCGCCGCTGAT 
TCCAGCCTGGGCGACAAGAGTGAGACTCCATCTCACACA 
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FIGURE 2X4 

MLPPALPPALVFTVAWSLLAERVSWVRDAEDAHRLQPFVTERTLGKVQRWSGVHTQTGGRAG 
GGQFCCAWLDSKRVLASPGWGAANSIKNQRVWAPATESSAQLLCCWPVGVARGGALCQ 
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FIGURE 
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GTCAISCCAGTGCCTGCTCTGTGCCTGCTCTGGGCCCTGGCAATGGTGACCCXKSCCTGCCTCA 

GCGGCCCCCATGGGCGGCCCAGAACTGGCACAGCATGAGGAGCTGACCCTGCTCTTCCATGG 

GACCCTGCAGCTGGGCCAGGCCCTCAACGGTGTGTACAGGACCACGGAGGGACGGCTGACAA 

AGGCCAGGAACAGCCTGGGTCTCTATGGCCGCACAATAGAACTCCTGGGGCAGGAGGTCAGC 

CGGGGCCGGGATGCAGCCCAGGAACTTCGGGCAAGCCTGTTGGAGACTCAGATGGAGGAGGA 

TATTCTGCAGCTGCAGGCAGAGGCCACAGCTGAGGTGCTGGGGGAGGTGGCCCAGGCACAGA 

AGGTGCTACGGGACAGCGTGCAGCGGCTAGAAGTCCAGCTGAGGAGCGCCTGGCTGGGCCCT 

GCCTACCGAGAATTTGAGGTCTTAAAGGCTCACGCTGACAAGCAGAGCCACATCCTATGGGC 

CCTCACAGGCCACGTGCAGCGGCAGAGGCGGGAGATGGTGGCACAGCAGCATCGGCTGCGAC 

AGATCCAGGAGAGACTCCACACAGCGGCGCTCCCAGCCTCAATCTGCCTGGATGGAACTGAG 

GACCAATCATGCTGCAAGGAACACTTCCACGCCCCGTGAGGCCCCTGTGCAGGGAGGAGCTG 

CCTGTTCACTGGGATCAGCCAGGGCGCCGGGCCCCACTTCTGAGCACAGAGCAGAGACAGAC 

GCAGGCGGGGACAAAGGCAGAGGATGTAGCCCCATTGGGGAGGGGTGGAGGAAGGACATGTA 

CCCTTTCATGCCTACACACCCCTCATTAAAGCAGAGTCGTGGCATTTCAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 2S6 

MPVPALCLLWALAMVTRPASAAPMGGPELAQHEELTLLFHGTLQLGQALNGVYRTTEGRLTK 

ARNSLGLYGRTIELLGQEVSRGRDAAQELRASLLETQMEEDILQLQAEATAEVLGEVAQAQK 

VLRDSVQRLEVQLRSAWLGPAYREFEVLKAHADKQSHILWALTGHVQRQRREMVAQQHRLRQ 
IQERLHTAALPA 
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FIGURE 2S7 

GGCAAC&ISGCTCAGCAGGCTTGCCCCAGAGCCATGGCAAAGAATGGACTTGTAATTTGCAT 

CCTGGTGATCACCTTACTCCTGGACCAGACCACCAGCCACACATCCAGATTAAAAGrc^^ 

AGCACAGCAAACGTCGAGTGAGAGACAAGGATGGAGATCTGAAGACTCAAATTGAAAAGCTC 

TGGACAGAAGTCAATGCCTTGAAGGAAATTCAAGCCCTGCAGACAGTCTGTCTCCGAGGCAC 

TAAAGTTCACAAGAAATGCTACCTTGCTTCAGAAGGTTTGAAGCATTTCCATGAGGCCAATG 

AAGACTGCATTTCCAAAGGAGGAATCCTGGTTATCCCCAGGAACTCCGACGAAATCAACGCC 

CTCCAAGACTATGGTAAAAGGAGCCTGCCAGGTGTCAATGACTTTTGGCTGGGCATCAATGA 

CATGGTCACGGAAGGCAAGTTTGTTGACGTCAACGGAATCGCTATCTCCTTCCTCAACTGGG 

ACCGTGCACAGCCTAACGGTGGCAAGCGAGAAAACTGTGTCCTGTTCTCCCAATCAGCTCAG 

GGCAAGTGGAGTGATGAGGCCTGTCGCAGCAGCAAGAGATACATATGCGAGTTCACCATCCC 

TAAATaSGTCTTTCTCCAATGTGTCCTCCAAGCAAGATTCATCATAACTTATAGGTTCATGA 

TCTCTAAGATCAAGTAAAAATCATAATTTTTACTTATTAAAAAATTGCAACACAAGATCAAT 

GTCCATAGCAATATGATAGCATCAGCCAATTTTGCTAACACATTTCTTTGGGATTTTGCCCT 

TCCTGGGGTATAGGGGATCAGAAATATTGATCCATGTGCACGCAGATAAAATGGCTTCTGCT 

AAACAGACTAAAATCTTTCTCTCTAGTCTTTCTCACTTGTACAAACCCAGTTTGTTTTCAAA 

AAATCACAGTAGCAATGCAACTCATCACTCTAGAAAAGCAAGCTTAGGCTACCTGAAAGATT 

TTCCCTTGGAAGTTTAGCGTATGTTTGACTAACAAAAATTCCCTACATCAGAGACTCTAGGT 

GCTATATAATCCAAAAACTTTTCAGCCTGTTGCTCATTCTGTCCCATGCTGGCAATAATACC 

TTGTCAGCCCATTACCCTTATTTTGAATTGCTCCATCTCCTGGTGGGACTTGTATCTTGTCT 

GCCATATCAGAACACAAACCCCTGAAGAGGTTCTGATTTGATTTTTTTTTTTTCTTCATGCC 

TACCCTTTTTTTGGAAGTTTCCAGCCGCAATTTGAAATGAAATGACAAGGTGTATATTTGAT 

CAATTTTCATTCCCACCATTGCATTACAACCTCTAACTTAAATGGGTAACCCTAAGGCATAT 

CAAAGAAGCAGATTGCATGATAAACGGAAATAGAAAAAAAGAACCTACATTTATTTTGCTTT 

AGCATCCTTACTCTCACCTTTTATGAGATTGAGAGTGGACTTACATTTCCTTTTTTACATTT 

TCGTATATTTATTTTTTTTAGCCATCATTATATGTTTAAGTCTATTATGGGCAACCAATCTT 

TGGAAGCTGAAAACTGAATTTAAAGAATGCTATCTTGGAAAATTGCATACGTCTGTGCAATT 

TTTTATTCTGCCTAGTGCTATTCTGCTTGTTTAACTAGATTGTACAAAATAACTTCATTGCT 

TAATATCAAATTACAAAGTTTAGACTTGGAGGGAAATGGGCTTTTTAGAAGCAAACAATTTT 

AAATATATTTTGTTCTTCAAATAAATAGTGTTTAAACATTGAATGTGTTTTGTGAACAATAT 

CCCACTTTGCAAACTTTAACTACACATGCTTGGAATTAAGTTTTAGCTGTTTTCATTGCTCA 

ATAATAAAGCCTGAATTCTGATCAATAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 288 

MAQQACPRAMAKNGLVICILVITLLLDQTTSHTSRLKARKHSKRRVRDKDGDLKTQIEKLWT 
EVNALKEIQALQTVCLRGTKVHKKCYLASEGLKHFHEANEDCISKGGILVIPRNSDEINALQ 
DYGKRSLPGVNDFWLGINDMVTEGKFVDVNGIAISFLNWDRAQPNGGKRENCVLFSQSAQGK 
WSDEACRSSKRYICEFTIPK 
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FIGURE 289 

GCGAGGACCGGGTATAAGAAGCCTCGTGGCCTTGCCCGGGCAGCCGCAGGTTCCCCGCGCGC 

CCCGAGCCCCCGCGCCATgAAGCTCGCCGCCCTCCTGGGGCTCTGCGTGGCCCTGTCCTGCA 

GCTCCGCTGCTGCTTTCTTAGTGGGCTCGGCCAAGCCTGTGGCCCAGCCTGTCGCTGCGCTG 

GAGTCGGCGGCGGAGGCCGGGGCCGGGACCCTGGCCAACCCCCTCGGCACCCTCAACCCGCT 

GAAGCTCCTGCTGAGCAGCCTGGGCATCCCCGTGAACCACCTCATAGAGGGCTCCCAGAAGT 

GTGTGGCTGAGCTGGGTCCCCAGGCCGTGGGGGCCGTGAAGGCCCTGAAGGCCCTGCTGGGG 

GCCCTGACAGTGTTTGGCTGAGCCGAGACTGGAGCATCTACACCTGAGGACAAGACGCTGCC 

CACCCGCGAGGGCTGAAAACCCCGCCGCGGGGAGGACCGTCCATCCCCTTCCCCCGGCCCCT 

CTCAATAAACGTGGTTAAGAGCAATUy^AAAAAAAAAAAAAAAAAAAAAAAAAAAAAA;^^ 

AAAAAAAAT^AAA 
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FIGURE 290 

MKLAALLGLCVALSCSSAAAFLVGSAKPVAQPVAALESAAEAGAGTLANPLGTLNPLKLLLS 
SLGI PVNHLIEGSQKCVAELGPQAVGAVKALKALLGALTVFG 
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FIGURE 291 

TGAAGGACTTTTCCAGGACCCAAGGCCACACACTGGAAGTCTTGCAGCTGAAGGGAGGCACT 

CCTTGGCCTCCGCAGCCGATCACATGAAGGTGGTGCCAAGTCTCCTGCTCTCCGTCCTCCTG 

GCACAGGTGTGGCTGGTACCCGGCTTGGCCCCCAGTCCTCAGTCGCCAGAGACCCCAGCCCC 

TCAGAACCAGACCAGCAGGGTAGTGCAGGCTCCCAGGGAGGAAGAGGAAGATGAGCAGGAGG 

CCAGCGAGGAGAAGGCCGGTGAGGAAGAGAAAGCCTGGCTGATGGCCAGCAGGCAGCAGCTT 

GCCAAGGAGACTTCAAACTTCGGATTCAGCCTGCTGCGAAAGATCTCCATGAGGCACGATGG 

CAACATGGTCTTCTCTCCATTTGGCATGTCCTTGGCCATGACAGGCTTGATGCTGGGGGCCA 

CAGGGCCGACTGAAACCCAGATCAAGAGAGGGCTCCACTTGCAGGCCCTGAAGCCCACCAAG 

CCCGGGCTCCTGCCTTCCCTCTTTAAGGGACTCAGAGAGACCCTCTCCCGCAACCTGGAACT 

GGGCCTCTCACAGGGGAGTTTTGCCTTCATCCACAAGGATTTTGATGTCAAAGAGACTTTCT 

TCAATTTATCCAAGAGGTATTTTGATACAGAGTGCGTGCCTATGAATTTTCGCAATGCCTCA 

CAGGCCAAAAGGCTCATGAATCATTACATTAACAAAGAGACTCGGGGGAAAATTCCCAAACT 

GTTTGATGAGATTAATCCTGAAACCAAATTAATTCTTGTGGATTACATCTTGTTCAAAGGGA 

AATGGTTGACCCCATTTGACCCTGTCTTCACCGAAGTCGACACTTTCCACCTGGACAAGTAC 

AAGACCATTAAGGTGCCCATGATGTACGGTGCAGGCAAGTTTGCCTCCACCTTTGACAAGAA 

TTTTCGTTGTCATGTCCTCAAACTGCCCTACCAAGGAAATGCCACCATGCTGGTGGTCCTCA 

TGGAGAAAATGGGTGACCACCTCGCCCTTGAAGACTACCTGACCACAGACTTGGTGGAGACA 

TGGCTCAGAAACATGAAAACCAGAAACATGGAAGTTTTCTTTCCGAAGTTCAAGCTAGATCA 

GAAGTATGAGATGCATGAGCTGCTTAGGCAGATGGGAATCAGAAGAATCTTCTCACCCTTTG 

CTGACCTTAGTGAACTCTCAGCTACTGGAAGAAATCTCCAAGTATCCAGGGTTTTACGAAGA 

ACAGTGATTGAAGTTGATGAAAGGGGCACTGAGGCAGTGGCAGGAATCTTGTCAGAAATTAC 

TGCTTATTCCATGCCTCCTGTCATCAAAGTGGACCGGCCATTTCATTTCATGATCTATGAAG 

AAACCTCTGGAATGCTTCTGTTTCTGGGCAGGGTGGTGAATCCGACTCTCCTATAATTCAGG 

ACATGCATAAGCACTTCGTGCTGTAGTAGATGCTGAATCTGAGGTATCAAACACACACAGGA 

TACCAGCAATGGATGGCAGGGGAGAGTGTTCCTTTTGTTCTTAACTAGTTTAGGGTGTTCTC 

AAATAAATACAGTAGTCCCCACTTATCTGAGGGGGATACATTCAAAGACCCCCAGCAGATGC 

CTGAAACGGTGGACAGTGCTGAACCTTATATATATTTTTTCCTACACATACATACCTATGAT 

AAAGTTTAATTTATAAATTAGGCACAGTAAGAGATTAACAATAATAACAACATTAAGTAAAA 

TGAGTTACTTGAACGCAAGCACTGCAATACCATAACAGTCAAACTGATTATAGAGAAGGCTA 

CTAAGTGACTCATGGGCGAGGAGCATAGACAGTGTGGAGACATTGGGCAAGGGGAGAATTCA 

CATCCTGGGTGGGACAGAGCAGGACGATGCAAGATTCCATCCCACTACTCAGAATGGCATGC 

TGCTTAAGACTTTTAGATTGTTTATTTCTGGAATTTTTCATTTAATGTTTTTGGACCATGGT 

TGACCATGGTTAACTGAGACTGCAGAAAGCAAAACCATGGATAAGGGAGGACTACTACAAAA 

GCATTAAATTGATACATATTTTTTAAAAAAAAAAAAAAAAAAA 
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FIGURE 292 



MKWPSLLLS VLLAQVWLVPGLAPS PQS PETPAPQNQTSR WQAPREEEEDEQEASEEKAGE 

EEKAWLMASRQQUVKETSNFGFSLLRKISMRHDGNIWFSPFGMSLAMT 

KRGLHLQALKPTKPGLLPSLFKGLRETLSRNLELGLSQGSFAFIHKDFDVKETFFNLSKRYF 

DTECVPMNFRNASQAKRLMNHYINKETRGKIPKLFDEINPETKLILVDYILFKGKWLTPFDP 

VFTEVDTFHLDKYKTIKVPMMYGAGKFASTFDKNFRCHVLKLPYQGNATMLVVLMEKM 

ALEDYLTTDLVETWLRNMKTRNMEVFFPKFKIjDQKYEMHELLRQMGIRRIFSPFADLSELSA 

TGRNLQVSRVLRRTVIEVDERGTEAVAGILSEITAYSMPPVIKVDRPFHFMIYEETSGMLLF 

LGRWNPTLL 
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FIGURE 293 

CTGGGATCAGCCACTGCAGCTCCCTGAGCACTCrCTACAGAGACGCGGACCCCAGACATG 

GAGGCTCCTCCTGGTCACCAGCCTGGTGGTTGTGCTGCTGTGGGAGGCAGGTGCAGTC^ 

CACCCAAGGTCCCTATCAAGATGCT^GTCAAACACTGGCCCTCAGAGCAGGACCC^ 

GCCTGGGGCGCCCGTGTGGTGGAGCCTCCGGAGAAGGACGACCAGCTGGTGGTGCTGTTCCC 

TGTCCAGAAGCCGAAACTCTTGACCACCGAGGAGAAGCCACGAGGTCAGGGCAGGGGCCCCA 

TCCTTCCAGGCACCAAGGCCTGGATGGAGACCGAGGACACCCTGGGCCGTGTCCTGAGTCCC 

GAGCCCGACCATGACAGCCTGTACCACCCTCCGCCTGAGGAGGACCAGGGCGAGGAGAGGCC 

CCGGTTGTGGGTGATGCCAAATCACCAGGTGCTCCTGGGACCGGAGGAAGACCAAGACCACA 

TCTACCACCCCCAGTAGGGCTCCAGGGGCCATCACTGCCCCCGCCCTGTCCCAAGGCCCAGG 

CTGTTGGGACTGGGACCCTCCCTACCCTGCCCCAGCTAGACAAATAAACCCCAGCAGGCAAA 

AAAAAAAAAAAAAAAA 
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FIGURE 294 

MRRLLLVTSLVVVLLWEAGAVPAPKVPIKMQVKHWPSEQDPEKAWGARVVEPPEKDDQLVVL 
FPVQKPKLLTTEEKPRGQGRGPILPGTKAWMETEDTLGRVLSPEPDHDSLYHPPPEEDQGEE 
RPRLWVMPNHQVLLGPEEDQDHIYHPQ 
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FIGURE 295 

AGAAAGCTGCACTCTGTTGAGCTCCAGGGCGCAGTGGAGGGAGGGAGTGAAG6AGCTCTCTG 
TACCCAAGGAAAGTGCAGCTGAGACTCAGACAAGATTACAASSAACCAACTCAGCTTCCTGC 
TGTTTCTCATAGCGACCACCAGAGGATGGAGTACAGATGAGGCTAATACTTACTTCAAGGAA 
TGGACCTGTTCTTCGTCTCCATCTCTGCCCAGAAGCTGCAAGGAAATCAAAGACGAATGTCC 
TAGTGCATTTGATGGCCTGTATTTTCTCCGCACTGAGAATGGTGTTATCTACCAGACCTTCT 
GTGACATGACCTCTGGGGGTGGCGGCTGGACCCTGGTGGCCAGCGTGCATGAGAATGACATG 
CGTGGGAAGTGCACGGTGGGCGATCGCTGGTCCAGTCAGCAGGGCAGCAAAGCAGACTACCC 
AGAGGGGGACGGCAACTGGGCCAACTACAACACCTTTGGATCTGCAGAGGCGGCCACGAGCG 
ATGACTACAAGAACCCTGGCTACTACGACATCCAGGCCAAGGACCTGGGCATCTGGCACGTG 
CCCAATAAGTCCCCCATGCAGCACTGGAGAAACAGCTCCCTGCTGAGGTACCGCACGGACAC 
TGGCTTCCTCCAGACACTGGGACATAATCTGTTTGGCATCTACCAGAAATATCCAGTGAAAT 
ATGGAGAAGGAAAGTGTTGGACTGACAACGGCCCGGTGATCCCTGTGGTCTATGATTTTGGC 
GACGCCCAGAA/U^CAGCATCTTATTACTCACCCTATGGCCAGCGGGAATTCACTGCGGGATT 
TGTTCAGTTCAGGGTATTTAATAACGAGAGAGCAGCCAACGCCTTGTGTGCTGGAATGAGGG 
TCACCGGATGTAACACTGAGCATCACTGCATTGGTGGAGGAGGATACTTTCCAGAGGCCAGT 
CCCCAGCAGTGTGGAGATTTTTCTGGTTTTGATTGGAGTGGATATGGAACTCATGTTGGTTA 
CAGCAGCAGCCGTGAGATAACTGAGGCAGCTGTGCTTCTATTCTATCGTTGAGAGTTTTGTG 
GGAGGGAACCCAGACCTCTCCTCCCAACCATGAGATCCCAAGGATGGAGAACAACTTACCCA 
GTAGCTAGAATGTTAATGGCAGAAGAGAAAACAATAAATCATATTGACTCAAGAAAAAAA 
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FIGURE 296 

MNQLSFLLFLIATTRGWSTDEAimTKEWTCSSSPSLPRSCKEIKDECPSAFDGLYFLRTEN 
GVIYQTFCDMTSGGGGWTLVASVHENDMRGKCTVGDRWSSQQGSKADYPEGDGNWANYNTFG 
SAEAATSDDYKNPGYYDIQAKDLGIWHVPNKSPMQHWRNSSLLRYRTDTGFLQTLGHNLFGI 
YQKYPVKYGEGKCWTDNGPVIPWYDFGDAQKTASYYSPYGQREFTAGFVQFRVFNNERAAN 
ALCAGMRWGCNTEHHCIGGGGYFPEASPQQCGDFSGFDWSGYGTHVGYSSSREITE 
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FIGURE 297 

GCGGAGCCGGCGCCGGCTGCGCAGAGGAGCCGCTCTCGCCGCCGCCACCTCGGCTGGGAGCC 

CACGAGGCTGCCGCATCCTGCCCTCGGAACAMSGGACTCGGCGCGCGAGGTGCTTGGGCCG 

CGCTGCTCCTGGGGACGCTGCAGGTGCTAGCGCTGCTGGGGGCCGCCCATGAAAGCGCAGCC 

ATGGCGGCATCTGCAAACATAGAGAATTCTGGGCTTCCACACAACTCa^GTGCTAA 

AGAGACTCTCCAA(:».TGTGCCTTCTGACCATACAAATGAAACTTCCAACA 

CACCAACTTCAGTTGCCTCAGACrCCAGTAATACAACGGTCACCACCATGAAACCTACAGC^ 

GCATCTAATACAACAACACCAGGGATGGTCTCAACAAATATGACTTCrACCACCT 

TACACCCAAAACAACAAGTGTTTCACAGAACACATCTCAGATATCAACATCCACAA^^ 

TAACCCACAATAGTTCAGTGACATCTGCTGCTTCATCAGTAACAATCACAACAACTATGCAT 

TCTGAAGCAAAGAAAGGATCAAAATTTGATACTGGGAGCTTTGTTGGTGGTATTGTATTAAC 

GCTGGGAGTTTTATCTATTCTTTACATTGGATGCAAAATGTATTACTCAAGAAGAGGCATTC 

GGTATCGAACCATAGATGAACATGATGCCATCATTTAAGGAAATCCATGGACCAAGGATGGA 

ATACAGATTGATGCTGCCCTATCAATTAATTTTGGTTTATTAATAGTTTAAAACAATATTCT 

CTTTTTGAAAATAGTATAAACAGGCCATGCATATAATGTACAGTGTATTACGTAAATATGTA 

AAGATTCTTCAAGGTAACAAGGGTTTGGGTTTTGAAATAAACATCTGGATCTTATAGACCGT 

TCATACAATGGTTTTAGCAAGTTCATAGTAAGACAAACAAGTCCTATCTTTTTTTTTTGGCT 

GGGGTGGGGGCATTGGTCACATATGACCAGTAATTGAAAGACGTCATCACTGAAAGACAGAA 

TGCCATCTGGGCATACAAATAAGAAGTTTGTCACAGCACTCAGGATTTTGGGTATCTTTTGT 

AGCTCACATAAAGAACTTCAGTGCTTTTCAGAGCTGGATATATCTTAATTACTAATGCCAC^ 

CAGAAATTATACAATCAAACTAGATCTGAAGCATAATTTAAGAAAAACATCAACATT 

TGCTTTAAACTGTAGTAGTTGGTCTAGAAACAAAATACTCC 
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FIGURE 298 

MGLGARGAWAAIililiGTI^VLALLGAAHESAAMAASANIENS^ 

TNETSNSTVKPPTSVASDSSNTTVTTMKPTAASNTTTPOflVSTNMTSTTLKSTPKTTSVSQN 
TSQISTSTMTVTHNSSVTSAASSVriTTTMHSEAKKGSKFDTGSFVGGIVLTLGVLSILYIG 
CKMYYSRRGIRYRTIDEHDAI I 
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FIGURE 299 

CAGCCGGGTCCCAAGCCTGTGCCTGAGCCTGAGCCTGAGCCTGAGCCCGAGCCGGGAGCCGG 
TCGCGGGGGCTCCGGGCrGTGGGACCGCTGGGCCCCCAGCGATGGCGACCCTGTGGOGAnnr 
CTTCTTCGGCITGGCTCCITGCTCAGCCTGTCGTGCCTGGCGCTTO 

GCAGCTGTCAGACGCCGCCAAGAATTTCGAGGATGTCAGATGTAAATGTATCTGCCCrCCCr 

ATAAAGAAAATTCTGGGCATATTTATAATAAGAACATATCTCAGAAAGATTGTGATTGCCTT 

CATGTTGTGGAGCCCATGCCTGTGCGGGGGCCTGATGTAGAAGCATACTGTCTACGCTGTGA 

ATGCAAATATGAAGAAAGAAGCTCTGTCACAATCAAGGTTACCATTATAATTTATCTCTCCA 

TTTTGGGCCTTCTACTTCTGTACATGGTATATCTTACTCTGGTTGAGCCCATACTGAAGAGG 

CGCCTCTTTGGACATGCACAGTTGATACAGAGTGATGATGATATTGGGGATCACCAGCCTTT 

TGCAAATGCACACGATGTGCTAGCCCGCTCCCGCAGTCGAGCCAACGTGCTGAACAAGGTAG 

AATATGCACAGCAGCGCTGGAAGCTTCAAGTCCAAGAGCAGCGAAAGTCTGTCTTT 

CATGTTGTCCTCAGCTAATTGGGAATTGAATTCAAGGTGACTAGAAAGAAACAGGCAGA 

OTGGAAAGAACTGACTGGGTTTTGCTGGGTTTCATTTTAATACCTTGTTGATTTCACCAACT 

GTTGCTGGAAGATTCAAAACTGGAAGCAAAAACTTGCTTGATTTTTTTTTCTTGTTAACGTA 

ATAATAGAGACATTTTTAAAAGCACACAGCTCAAAGTCAGCCAATAAGTCTTTTCCTATTTG 

TGACTTTTACTAATAAAAATAAATCTGCCTGTAAATTATCTTGAAGTCCTTTACCTGGAACA 

AGCACTCTCTTTTTCACCACATAGTTTTAACTTGACTTTCAAGATAATTTTCAGGGTTTTTG 

TTGTTGTTGTTTTTTGTTTGTTTGTTTTGGTGGGAGAGGGGAGGGATGCCTGGGAAGTGGTT 

AACAACTTTTTTCAAGTCACTTTACTAAACAAACTTTTGTAAATAGACCTTACCTTCTATTT 

TCGAGTTTCATTTATATTTTGCAGTGTAGCCAGCCTCATCAAAGAGCTGACITACTCA 

ACTTTTGCACTGACTGTATTATCTGGGTATCTGCTGTGTCTGCACTTCATGGTAAACGGGAT 

CTAAAATGCCTGGTGGCTTTTCACAAAAAGCAGATTTTCTTCATGTACTGTGATGTCTGATG 

CAATGCATCCTAGAACAAACTGGCCATTTGCTAGTTTACTCTAAAGACTAAACATAGTCTTG 

GTGTGTGTGGTCTTACTCATCTTCTAGTACCTTTAAGGACAAATCCTAAGGACTTGGACACT 

TGCAATAAAGAAATTTTATTTTAAACCCAAGCCTCCCTGGATTGATAATATATACACATTTG 

TCAGCATTTCCGGTCGTGGTGAGAGGCAGCTGTTTGAGCTCCAATATGTGCAGCTTTGAACT 

AGGGCTGGGGTTGTGGGTGCCTCTTCTGAAAGGTCTAACCATTATTGGATAACTGGCTTTTT 

TCTTCCTATGTCCTCTTTGGAATGTAACAATAAAAATAATTTTTGAAACATC^ 
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FIGURE 300 

MATLWGGLUtLGSLLSLSCIJaiSVLLLAQLSDAAKNFEDVRCKCICPPYKE^ 
QKDCaDCIiHVVEPMPVRGPDVEAYCLRCECKYEERSSVTIKVTI 1 1 YLS ILGLLLLYMVYLTL 
VEPILKRRLFGHAQLIQSDDDIGDHQPFANAHDVLARSRSRANVLNKVEYAQQRWKLQVQEQ 
RKSVFDRHWIS 
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FIGURE 301 

GCACCTGCGACCACCGTGAGCAGT CATGG CGTACTCCACAGTGCAGAGAGTCGCTCTGGCTT 
CTGGGCTTGTCCTGGCTCTGTCGCrGCTGCTGCCCAAGGCCTTCCT 

CAGGAGCCGCCXSCCGACACCTGAAGGAAAATTOGGCCGATTTCCACCTATGATGCATCATO^ 

CCAGGCACCCTCAGATGGCCAGACTCCTGGGGCTCGTTTCCAGAGGTCTCACCTTGCCGAGG 

CATTTGCAAAGGCCT^GGATCAGGTGGAGGTGCTGGAGGAGGAGGTAGTGGAAGAGGTCTG 

ATGGGGCAGATTATTCCAATCTACGGTTTTGGGATTTTTTTATATATACTGTAC^ 

TAAGGTAAGTAGAATCATCCTAATCATATTACATCA ATGAA AATCTAATATGGCGATAAAAA 

TCATTGTCTACATTAAAACrTCTTATAGTT(ai.TAAAATTATTTCAAATCCATC^ 

AATCCTGCCTCCTCTTCATGAGGTACTTAGGATAGCCATTATTTCAGTTTCACATAAGAATG 

TTTACTCTUVTGTTTAAGTGTTTTGCCCCAAAATTCACAACTAACAAGGCAGAAC^^ 

GAACATGGATCTTTTGGTTCTTAATCCAGTQAGTGATACAATTCAATGCACTCCCCrrGCC^ 
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FIGURE 302 

MAYSTVQRViUJ^GLVLALSUjLPKAFLSRGKRQEPPPTPEGKLGRFPPMMHHHQAPSDGQT 

PGARFQRSHLAEAFAKAKGSGGGAGGGGSGRGLMGQIIPIYGFGIFLYILYILFKVSRIILI 
ILHQ 
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FIGURE 303 

CGGCTCGAGTGCAGCTGTGGGGAGATTTCAGTGCATTGCCTCCCCTGGGTGCTCTTCATCTT 

GGATTTGAAAGTTGAGAGCAGCATGTTTTGCCCACTGAAACTCATCCTGCTGCCAGTGTTAC 

TGGATTATTCCTTGGGCCTGAATGACTTGAATGTTTCCCCGCCTGAGCTAACAGTCCATGTG 

GGTGATTCAGCTCTGATGGGATGTGTTTTCCAGAGCACAGAAGACAAATGTATATTCAAGAT 

AGACTGGACTCTGTCACCAGGAGAGCACGCCAAGGACGAATATGTGCTATACTATTACTCCA 

ATCTCAGTGTGCCTATTGGGCGCTTCCAGAACCGCGTACACTTGATGGGGGACATCTTATGC 

AATGATGGCTCTCTCCTGCTCCAAGATGTGCAAGAGGCTGACCAGGGAACCTATATCTGTGA 

AATCCGCCTCAAAGGGGAGAGCCAGGTGITCAAGAAGGCGGTGGTACTGCATGTGCT^ 

AGGAGCCCAAAGAGCTCATGGTCCATGTGGGTGGATTGATTCAGATGGGATGTGTTTTCCAG 

AGCACAGAAGTGAAACACGTGACCAAGGTAGAATGGATATTTTCAGGACGGCGCGCAAAGGA 

GGAGATTGTATTTCGTTACTACCACAAACTCAGGATGTCTGTGGAGTACTCCCAGAGCTGGG 

GCCACTTCCAGAATCGTGTGAACCTGGTGGGGGACATTTTCCGCAATGACGGTTCCATCATG 

CTTCAAGGAGTGAGGGAGTCAGATGGAGGAAACTACACCTGCAGTATCCACCTAGGGAACCT 

GGTGTTCAAGAAAACCATTGTGCTGCATGTCAGCCCGGAAGAGCCTCGAACACTGGTGACCC 

CGGCAGCCCTGAGGCCTCTGGTCTTGGGTGGTAATCAGTTGGTGATCATTGTGGGAATTGTC 

TGTGCCACAATCCTGCTGCTCCCTGTTCTGATATTGATCGTGAAGAAGACCTGTGGAAATAA 

GAGTTCAGTGAATTCTACAGTCTTGGTGAAGAACACGAAGAAGACTAATCCAGAGATAAAAG 

AAAAACCCTGCCATTTTGAAAGATGTGAAGGGGAGAAACACATTTACTCCCCAATAATTGTA 

CGGGAGGTGATCGAGGAAGAAGAACCAAGTGAAAAATCAGAGGCCACCTACATGACCATGCA 

CCCAGTTTGGCCTTCTCTGAGGTCAGATCGGAACAACTCACTTGAAAAAAAGTCAGGTGGGG 

GAATGCCAAAAACACAGCAAGCCTTTTGAGAAGAATGGAGAGTCCCTTCATCTCAGC^^ 

TGGAGACTCTCTCCTGTGTGTGTCCTGGGCCACTCTACCAGTGATTTCAGACTCCCGCTCTC 

CCAGCTGTCCTCCTGTCTCATTGTTTGGTCAATACACTGAAGATGGAGAATTTGGAGCCTGG 

CAGAGAGACTGGACAGCTCTGGAGGAACAGGCCTGCTGAGGGGAGGGGAGCATGGACTTGGC 

CTCTGGAGTGGGACACTGGCCCTGGGAACCAGGCTGAGCTGAGTGGCCTCAAACCCCCCGTT 

GGATCAGACCCTCCTGTGGGCAGGGTTCTTAGTGGATGAGTTACTGGGAAGAATCAGAGATA 

AAAACCAACCCAJ^TCAA 
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FIGURE 304 

MFCPLKLILLPVLLDYSLGLNDLNVSPPELTVHVGDSALMGCVFQSTEDKCIFKIDOT^ 

EHAKDEYVLYYYSNLSVPIGRFQNRVHLMGDILCNDGSLLLQDVQEADQGTYICEIRLKGES 

QVFKKAVVIJ^VLPEEPKEL^4VHVGGLIQMGCVFQSTEVKHVTKVEWIFSGRRA 

HKLRMSVEYSQSWGHFQNRVNLVGDIFRNDGSIMLCySVRESDGGNYTCSIHLGNLVFKKTIV 

LHVSPEEPRTLVTPAALRPLVLGGNQLVIIVGIVCATILLLPVLILIVKKTCGNKSSVNSTV 

LVKNTKKTNPEIKEKPCHFERCEGEKHIYSPIIVREVIEEEEPSEKSEATYMTMHPVWPSLR 

SDRMNSLEKKSGGGMPKTQQAF 
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FIGURE 305 

CTATGAAGAAGCTTCCTGGAAAACAATAAGCAAAGGAAAACAAATGTGTCCCATCT 

GTTCTACCCTACTAAAGACAGGAAGATCATAAACTGACAGATACTGAAATTGTAAGAGTTGG 

AAACTACATTTTGCAAAGTCATTGAACrCTGAGCTCAGTTGCAGTACTCC^ 

GGATGAAGATGGATACATCACCTTAAATATTAAAACTCGGAAACCAGCTCTCGTCTCCGTTG 

GCCCTGCATCCTCCTCCTGGTGGCGTGTGATGGCTTTGATTCTGCTGATCCTGTGCGTGGGG 

ATGGTTGTCGGGCTGGTGGCTCTGGGGATTTGGTCTGTCATGCAGCGCAATTACCTACAAGA 

TGAGAATGAAAATCGCACAGGAACTCTGCAACAATTAGCAAAGCGCTTCTGTCAATATGTGG 

TAAAACAATCAGAACTAAAGGGCACTTTCAAAGGTCATAAATGCAGCCCCTGTGACACAAAC 

TGGAGATATTATGGAGATAGCTGCTATGGGTTCTTCAGGCACAACTTAACATGGGAAGAGAG 

TAAGCAGTACTGCACTGACATGAATGCTACTCTCCTGAAGATTGACAACCGGAACATTGTGG 

AGTACATCAAAGCCAGGACTCATTTAATTCGTTGGGTCGGATTATCTCGCCAGAAGTCGAAT 

GAGGTCTGGAAGTGGGAGGATGGCTCGGTTATCTCAGAAAATATGTTTGAGTTTTTGGAAGA 

TGGAAAAGGAAATATGAATTGTGCTTATTTTCATAATGGGAAAATGCACCCTACCTTCTGTG 

AGAACAAACATTATTTAATGTGTGAGAGGAAGGCTGGCATGACCAAGGTGGACCAACTACCT 

TAATGCAAAGAGGTGGACAGGATAACACAGATAAGGGCTTTATTGTACAATAAAAGATATGT 

ATGAATGCATCAGTAGCTGAAAAAAAAAAAAAA 
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FIGURE 306 

MQDEDGYITIJSIIKTRKPALVSVGPASSSWWRVMALILLILCVGMVVGLVALGIWSVMQRNYL 

QDENENRTGTLCXJIAKRFCQYVVKQSELKGTFKGHKCSPCDTNWRYYGDSCyGFFRHNLTWE 

ESKQYCTl)MNATIiKIDNRNIVEYIKARTHLIRWVGLSRQKSNEVWKWEDGSVISENMFEFL 
EDGKGNMNCAYFHNGKMHPTFCENKHYLMCERKAGMTKVDQLP 
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